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HBM VARIANTS THAT MODULATE BONE MASS AND LIPID LEVELS 

HELD O F TH E INVENT I ON 
The present invention relates generally to the field of genetics, genomics and 
molecular biology. The invention relates to methods and materials used to isolate, detect and 
sequence a high bone mass gene and corresponding wild-type gene, and mutants thereof. The 
present invention also relates to the high bone mass (HBM) gene, the corresponding wild- 
type gene, and mutants thereof. The genes identified in the present invention are implicated 
in the ontology and physiology of bone development. The invention also provides nucleic 
acids, proteins, cloning vectors, expression vectors, transformed hosts, methods of 
developing pharmaceutical compositions, methods of identifying molecules involved in bone 
development, and methods of diagnosing and treating diseases involved in bone development 
and lipid levels. The invention further relates to transgenic animals for studying the HBM 
phenotype and related variant phenotypes, the mechanism of action of the HBM gene and its 
variants, and factors and treatments affecting normal and abnormal bone conditions. In 
preferred embodiments, the present invention is directed to methods for treating, diagnosing, 
preventing and screening for normal and abnormal conditions of bone, including metabolic 
bone diseases such as osteoporosis. 

BA CKGROUND OF TH E I NVENT I ON 
Two of the most common types of osteoporosis are postmenopausal and senile 
osteoporosis. Osteoporosis affects men as well as women, and, taken with other 
abnormalities of bone, presents an ever-increasing health risk for an aging population. The 
most common type of osteoporosis is that associated with menopause. Most women lose 
between 20-60% of the bone mass in the trabecular compartment of the bone within 3-6 years 
after the cessation of menses. This rapid loss is generally associated with an increase of bone 
resorption and formation. However, the resorptive cycle is more dominant and the result is a 
net loss of bone mass. Osteoporosis is a common and serious disease among postmenopausal 
women. There are an estimated 25 million women in the United States alone who are 
afflicted with this disease. The results of osteoporosis are both personally harmful, and also 
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account for a large economic loss due to its chronicity and the need for extensive and long- 
term support (hospitalization and nursing home care) from the disease sequelae. This is 
especially true in more elderly patients. Additionally, while osteoporosis is generally not 
thought of as a life-threatening condition, a 20-30% mortality rate is related to hip fractures in 

5 elderly women. A large percentage of this mortality rate can be directly associated with 

postmenopausal osteoporosis. The costs alone associated with the treatment of osteoporotic 
fractures in the United States is $10 to $15 billion annually. Worldwide incidence of 
osteoporotic hip fractures is estimated to exceed 1.7 million cases. 

The most vulnerable tissue in the bone to the effects of postmenopausal osteoporosis 

10 is the trabecular bone. This tissue is often referred to as spongy bone and is particularly 

concentrated near the ends of the bone near the joints and in the vertebrae of the spine. The 
trabecular tissue is characterized by small structures which inter-connect with each other as 
well as the more solid and dense cortical tissue which makes up the outer surface and central 
shaft of the bone. This crisscross network of trabeculae gives lateral support to the outer 

15 cortical structure and is critical to the biomechanical strength of the overall structure. In 
postmenopausal osteoporosis, it is primarily the net resorption and loss of the trabeculae 
which lead to the failure and fracture of the bone. In light of the loss of the trabeculae in 
postmenopausal women, it is not surprising that the most common fractures are those 
associated with bones which are highly dependent on trabecular support, e.g., the vertebrae, 

20 the neck of the femur, and the forearm. Indeed, hip fracture, Colle f s fractures, and vertebral 
crush fractures are indicative of postmenopausal osteoporosis. 

One of the earliest generally accepted methods for treatment of postmenopausal 
osteoporosis was estrogen replacement therapy. Although this therapy frequently is 
successful, patient compliance is low, primarily due to the undesirable side-effects of chronic 

25 estrogen treatment. Frequently cited side-effects of estrogen replacement therapy include 
reinitiation of menses, bloating, depression, and fear of breast or uterine cancer. In order to 
limit the known threat of uterine cancer in those women who have not undergone a 
hysterectomy, a protocol of estrogen and progestin cyclic therapy is often employed. This 
protocol is similar to that which is used in birth control regimens, and often is not tolerated 

30 by women because of the side-effects characteristic of progestin. More recently, certain 
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antiestrogens, originally developed for the treatment of breast cancer, have been shown in 
experimental models of postmenopausal osteoporosis to be efficacious. Among these agents 
is raloxifene (See, U.S. Patent No. 5,393,763, and Black et al, J. Clin. Invest, 93:63-69 
(1994)). In addition, tamoxifene, a widely used clinical agent for the treatment of breast 
5 cancer, has been shown to increase bone mineral density in post menopausal women 
suffering from breast cancer (Love et al, N. Engl J. Med., 326:852-856 (1992)). 

Another therapy for the treatment of postmenopausal osteoporosis is the use of 
calcitonin. Calcitonin is a naturally occurring peptide which inhibits bone resorption and has 
been approved for this use in many countries (Overgaard et al, Br. Med. J., 305:556-561 

10 (1992)). The use of calcitonin has been somewhat limited, however. Its effects are very 
modest in increasing bone mineral density and the treatment is very expensive. Another 
therapy for the treatment of postmenopausal osteoporosis is the use of bis-phosphonates. 
These compounds were originally developed for use in Paget's disease and malignant 
hypercalcemia. They have been shown to inhibit bone resorption. Alendronate, one 

1 5 compound of this class, has been approved for the treatment of postmenopausal osteoporosis. 
These agents may be help fill in the treatment of osteoporosis, but these agents also have 
potential liabilities which include osteomalacia, extremely long half-life in bone (greater than 
2 years), and possible "frozen bone syndrome," e.g., the cessation of normal bone 
remodeling. 

20 Senile osteoporosis is similar to postmenopausal osteoporosis in that it is marked by 

the loss of bone mineral density and resulting increase in fracture rate, morbidity, and 
associated mortality. Generally, it occurs in later life, i.e., after 70 years of age. Historically, 
senile osteoporosis has been more common in females, but with the advent of a more elderly 
male population, this disease is becoming a major factor in the health of both sexes. It is not 

25 clear what, if any, role hormones such as testosterone or estrogen have in this disease, and its 
etiology remains obscure. Treatment of this disease has not been very satisfactory. Hormone 
therapy, estrogen in women and testosterone in men, has shown equivocal results; calcitonin 
and bis-phosphonates may be of some utility. 

The peak mass of the skeleton at maturity is largely under genetic control. Twin 

30 studies have shown that the variance in bone mass between adult monozygotic twins is 
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smaller than between dizygotic twins (Slemenda et al, /. Bone Miner. Res., 6:561-567 (1991); 
Young et al, J. Bone Miner. Res., 6:561-567 (1995); Pocock et al, J. Clin. Invest, 80:706-710 
(1987); Kelly et al, /. Bone Miner. Res., 8:11-17 (1993)), and it has been estimated that up to 
60% or more of the variance in skeletal mass is inherited (Krall et al, J. Bone Miner. Res. , 
10:S367 (1993)). Peak skeletal mass is the most powerful determinant of bone mass in 
elderly years (Hui et al, Ann. Int. Med., 1 1 1:355-361 (1989)), even though the rate of age- 
related bone loss in adult and later life is also a strong determinant (Hui et al, Osteoporosis 
Int. , 1 :30-34 (1995)). Since bone mass is the principal measurable determinant of fracture 
risk, the inherited peak skeletal mass achieved at maturity is an important determinant of an 
individual's risk of fracture later in life. Thus, study of the genetic basis of bone mass is of 
considerable interest in the etiology of fractures due to osteoporosis. 

Recently, a strong interest in the genetic control of peak bone mass has developed in 
the field of osteoporosis. The interest has focused mainly on candidate genes with suitable 
polymorphisms to test for association with variation in bone mass within the normal range, or 
has focused on examination of genes and gene loci associated with low bone mass in the 
range found in patients with osteoporosis. The vitamin D receptor locus (VDR) (Morrison et 
al, Nature, 367:284-287 (1994)), PTH gene (Howard et al, J. Clin. Endocrinol. Metab., 
80:2800-2805 (1995); Johnson et al, J. Bone Miner. Res., 8:1 1-17 (1995); Gong et al, Bone 
Miner. Res., 10:S462 (1995)) and the estrogen receptor gene (Hosoi et al, J. Bone Miner. 
Res., 10:S170 (1995); Morrison et al, Nature, 367:284-287 (1994)) have figured most 
prominently in this work. These studies are difficult because bone mass (the phenotype) is a 
continuous, quantitative, polygenic trait, and is confounded by environmental factors such as 
nutrition, co-morbid disease, age, physical activity, and other factors. Also, this type of study 
design requires large numbers of subjects. In particular, the results of VDR studies to date 
have been confusing and contradictory (Garnero et al, J. Bone Miner. Res., 10:1283-1288 
(1995); Eisman et al, /. Bone. Miner. Res., 10:1289-1293 (1995); Peacock, J. Bone Miner. 
Res., 10:1294-1297 (1995)). Furthermore, the work thus far has not shed much light on the 
mechanism(s) whereby the genetic influences might exert their effect on bone mass. 

While it is well known that peak bone mass is largely determined by genetic rather 
than environmental factors, studies to determine the gene loci (and ultimately the genes) 
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linked to variation in bone mass are difficult and expensive. Study designs which utilize the 
power of linkage analysis e.g., sib-pair or extended family, are generally more informative 
than simple association studies, although the latter do have value. However, genetic linkage 
studies involving bone mass are hampered by two major problems. The first problem is the 
5 phenotype, as discussed briefly above. Bone mass is a continuous, quantitative trait, and 

establishing a discrete phenotype is difficult. Each anatomical site for measurement may be 
influenced by several genes, many of which may be different from site to site. The second 
problem is the age component of the phenotype. By the time an individual can be identified 
as having low bone mass, there is a high probability that their parents or other members of 

10 prior generations will be deceased and therefore unavailable for study, and younger 

generations may not have even reached peak bone mass, making their phenotyping uncertain 
for genetic analysis. 

Regardless, linkage analysis can be used to find the location of a gene causing a 
hereditary "disorder" and does not require any knowledge of the biochemical nature of the 

1 5 disorder, i.e., a mutated protein that is believed to cause the disorder does not need to be 

known. Traditional approaches depend on assumptions concerning the disease process that 
might implicate a known protein as a candidate to be evaluated. The genetic localization 
approach using linkage analysis can be used to first find the general chromosomal region in 
which the defective gene is located and then to gradually reduce the size of the region in 

20 order to determine the location of the specific mutated gene as precisely as possible. After 
the gene itself is discovered within the candidate region, the messenger RNA and the protein 
are identified and, along with the DNA, are checked for mutations. 

The genetic localization approach has practical implications since the location of the 
disease can be used for prenatal diagnosis even before the altered gene that causes the disease 

25 is found. Linkage analysis can enable families, even many of those that do not have a sick 
child, to know whether they are carriers of a disease gene and to evaluate the condition of an 
unborn child through molecular diagnosis, The transmission of a disease within families, 
then, can be used to find the defective gene. As used herein, reference to "high bone mass" 
(HBM) is analogous to reference to a disease state, although from a practical standpoint high 

30 bone mass can actually help a subject avoid the disease known as osteoporosis. 
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Linkage analysis is possible because of the nature of inheritance of chromosomes 
from parents to offspring. During meiosis, the two parental homologues pair to guide their 
proper separation to daughter cells. While they are lined up and paired, the two homologues 
exchange pieces of the chromosomes, in an event called "crossing over" or "recombination." 

5 The resulting chromosomes are chimeric, that is, they contain parts that originate from both 
parental homologues. The closer together two sequences are on the chromosome, the less 
likely that a recombination event will occur between them, and the more closely linked they 
are. In a linkage analysis experiment, two positions on the chromosomes are followed from 
one generation to the next to determine the frequency of recombination between them. In a 

10 study of an inherited disease, one of the chromosomal positions is marked by the disease gene 
or its normal counterpart, i.e., the inheritance of the chromosomal region can be determined 
by examining whether the individual displays symptoms of the disorder or not. The other 
position is marked by a DNA sequence that shows natural variation in the population such 
that the two homologues can be distinguished based on the copy of the "marker" sequence 

1 5 that they possess. In every family, the inheritance of the genetic marker sequence is 

compared to the inheritance of the disease state. If, within a family carrying an autosomal 
dominant disorder such as high bone mass, every affected individual carries the same form of 
the marker and all the unaffected individuals carry at least one different form of the marker, 
there is a great probability that the disease gene and the marker are located close to each 

20 other. In this way, chromosomes may be systematically checked with known markers and 

compared to the disease state. The data obtained from the different families is combined, and 
analyzed together by a computer using statistical methods. The result is information 
indicating the probability of linkage between the genetic marker and the disease allowing 
different distances between them. A positive result can mean that the disease is very close to 

25 the marker, while a negative result indicates that it is far away on that chromosome, or on an 
entirely different chromosome. 

Linkage analysis is performed by typing all members of the affected family at a given 
marker locus and evaluating the co-inheritance of a particular disease state with the marker 
probe, thereby determining how often the two of them are co-inherited. The recombination 

30 frequency can be used as a measure of the genetic distance between two gene loci. A 
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recombination frequency of 1% is equivalent to 1 map unit, or 1 centiMorgan (cM), which is 
roughly equivalent to 1,000 kb of DNA. This relationship holds up to frequencies of about 
20% or 20 cM. 

The entire human genome is 3,300 cM long. In order to find an unknown disease 
5 gene within 5-10 cM of a marker locus, the whole human genome can be searched with 

roughly 330 informative marker loci spaced at approximately 10 cM intervals (Botstein et aL, 
Am. J. Hum. Genet, 32:314-331 (1980)). The reliability of linkage results is established by 
using a number of statistical methods. The method most commonly used for the analysis of 
linkage in humans is the LOD score method (Morton, Prog. Clin. Biol. Res., 147:245-265 

10 (1984), Morton et aL, Am. J. Hum. Genet, 38:868-883 (1986)) which was incorporated into 
the computer program, LIPED, by Ott, Am. J. Hum. Genet, 28:528-529 (1976). LOD scores 
are the logarithm of the ratio of the likelihood that two loci are linked at a given distance to 
that they are not linked (>50 cM apart). The advantage of using logarithmic values is that 
they can be summed among families with the same disease. This becomes necessary given 

15 the relatively small size of human families. 

By convention, a total LOD score greater than + 3.0 (that is, odds of linkage at the 
specified recombination frequency being 1000 times greater than odds of no linkage) is 
considered to be significant evidence for linkage at that particular recombination frequency. 
A total LOD score of less than - 2.0 (that is, odds of no linkage being 100 times greater than 

20 odds of linkage at the specified frequency) is considered to be strong evidence that the two 
loci under consideration are not linked at that particular recombination frequency. Until 
recently, most linkage analyses have been performed on the basis of two-point data, which is 
the relationship between the disorder under consideration and a particular genetic marker. 
However, as a result of the rapid advances in mapping the human genome over the last few 

25 years, and concomitant improvements in computer methodology, it has become feasible to 
carry out linkage analyses using multi-point data. Multi-point analysis provide a 
simultaneous analysis of linkage between the disease and several linked genetic markers, 
when the recombination distance among the markers is known. 

Multi-point analysis is advantageous for two reasons. First, the informativeness of the 

30 pedigree is usually increased. Each pedigree has a certain amount of potential information, 
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dependent on the number of parents heterozygous for the marker loci and the number of 
affected individuals in the family. However, few markers are sufficiently polymorphic as to 
be informative in all those individuals. If multiple markers are considered simultaneously, 
then the probability of an individual being heterozygous for at least one of the markers is 

5 greatly increased. Second, an indication of the position of the disease gene among the 
markers may be determined. This allows identification of flanking markers, and thus 
eventually allows isolation of a small region in which the disease gene resides. Lathrop et 
al, Proc. Natl Acad. Set USA, 81:3443-3446 (1984) have written the most widely used 
computer package, LINKAGE, for multi-point analysis. 

10 There is a need in the art for identifying the gene associated with a high bone mass 

phenotype. There is also a need for tools for the study of the high bone mass gene and 
phenotype. More generally there is need for the development of diagnostic tools and 
treatments. The present invention is directed to these, as well as other, important ends. 

15 ST I MM ARY OF THE INVENT I ON 

It is an object of the invention to provide a nucleic acid comprising a mutation in 
LRP5 or LRP6 which results in a HBM phenotype when expressed in a cell, wherein said 
HBM phenotype results in bone mass modulation and/or lipid level modulation. Another 
embodiment contemplates that the mutation is located in propeller 1. In another embodiment, 

20 the nculeic acid encodes a mutation comprising at least one mutation of Tables 2 or 3 or 

which results in a mutation of one of the following G171 V, A214V, A65V, M282V, G171K, 
G171F, G171I, G171Q, L200V, T201 V, I202V, or S127V when expressed in a cell, and 
wherein expression of the nucleic acid in a subject results in bone mass modulation and/or 
lipid level modulation. In the instance of LRP6, the mutation is located in a position 

25 equivalent to LRP5 such that expression of the nucleic acid in a subject results in bone mass 
modulation and/or lipid level modulation. In another embodiment, the preferred mutation of 
LRP5 is G171V, A214V, A65V, M282V, G171K, G171F, G171I or G171Q or in an 
equivalent location if dealing with LRP6. 

Another embodiment contemplated herein is a polypeptide encoded by any of the 

30 above nucleic acids, wherein said polypeptide when expressed in a cell modulates Wnt 
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signaling, LRP5 activity and/or LRP6 activity. The polypeptide can additionally or 

alternatively modulate bone mass and/or lipid levels when expressed in a subject. These 

polypeptides or biologically active fragments thereof may preferably contain any of the 

following mutations of Table 2, G171V, A214V, A65V, M282V, G171K, G171F, G171I, 
5 G171Q, L200V, T201 V, I202V, or S127V in LRP5 or in a equivalent location in LRP6. The 

most preferred mutations are G171 V, A214V 5 A65V, M282V, G171K, G171F, G171I or 

G171Q in LRP5 or an equivalent location in LRP6. 

Yet another embodiment contemplates antibodies an immunogenic fragments thereof 

which bind to these proteins. The contemplated antibodies include a monoclonal antibody, a 
1 0 chimeric antibody, a bispecific antibody, a humanized antibody, a primatized® antibody, a 

human antibody, or a labeled antibody. Preferably, some of the antibodies can discriminate 

between the wild type and variant forms of LRP5 and LRP6. 

Another embodiment contemplates antibodies which bind to polypeptides comprising: 

208 KLYWADAKLSFIHRAN 223 , 277 ALYSPMDIQVLSQER 291 , 
1 5 61 GLEDAAAVDFQFSKGA 73 , 234 EGSLTHPFALTLSG 247 , 249 TLYWTDWQTRSIHACN 264 , 

144 VLFWQDLDQPRAI 156 , 194 IYWPNGLTIDLEEQKLY 2I0 5 34 LLLFANRRDVRLVD 47 , 

75 GAVYWTDVSEEAIKQ 89 , 121 KLYWTDSETNRIEVA 135 of LRP5 or an equivalent domain 

on LRP6 or variants thereof. 

The above antibodies can also be used in a composition for modulating bone mass 
20 and/or lipid levels in a subject comprising a therapeutically effective amount of the antibody 

or immunogenic fragment and a pharmaceutically acceptable carrier. 

The invention further contemplates a method of diagnosing a HBM like phenotype in 

a subject comprising: (A) obtaining a biological sample from said subject; (B) exposing the 

sample to one of the described antibodies or immunogenic fragments; and (C) detecting 
25 whether the antibody bound a protein from the biological sample from said subject to 

determine whether the subject has a HBM-like phenotype. 

Another embodiment contemplates a transgenic animal having somatic and/or germ 

cells comprising a nucleic acid which comprises a promoter region that directs protein 

expression in animal and/or human cells operably linked to any of the herein described 
30 nucleic acids, and wherein said transgenic animal has at least three bone parameters 
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modulated by the expression of said nucleic acid. The promoter region can be selected from 
the group consisting of CMV, RSV, SV40, and EF-la, CMVpbActin, histone, type I 
collagen, TGF01, SX2, cfos/cjun, Cbfal, Fra/Jun, Dlx5, osteocalcin, osteopontin, bone 
sialoprotein, and collagenase promoter regions. 
5 A further embodiment of the invention contemplates an animal model for the study of 

bone density modulation and/or lipid level modulation comprising a first group of animals 
composed of any of the described transgenic animals and a second group of control animals. 

Another embodiment provides for a method of identifying agents which modulate the 
activity of an HBM-like nucleic acid comprising: (A) transfecting a cell with a vector of 

1 0 claim 1 1 ; (B) exposing the transfected cell of step (A) to a compound; and (C) determining 

whether the compound modulates the activity of the HBM-like nucleic acid. Such agents can 
include a hormone, a growth factor, a peptide, RNA, siRNA, DNA, a mineral, a vitamin, a 
natural product, or a synthetic organic compound. 

Another aspect of the invention provides for a method for identifying compounds 

15 which modulate the interaction of Dkk with the Wnt signaling pathway comprising: (A) 

transfecting cells with constructs containing any of the described nucleic acids; (B) assessing 
changes in expression of a reporter element linked to a Wnt-responsive promoter; and (C) 
identifying as a Dkk/Wnt interaction modulating compound any compound which alters 
reporter gene expression compared with cells transfected with a Dkk construct alone. The 

20 cells are preferably cancer cells, liver cells or bone cells. The reporter element used is TCF- 
luciferase, tk-Renilla, or a combination thereof. 

Yet another embodiment includes a method of diagnosing a subject as expressing a 
nucleic acid comprising a nucleotide change of Tables 2 or 3 or any other mutations, the 
method comprising the steps of: (A) obtaining a biological sample from the subject; and (B) 

25 assaying for the presence of the nucleotide change which results in HBM phenotype. 

The invention also provides agents identified by the above methods which regulate 
Wnt activity, Dkk activity, bone mass and/or lipid levels. 

BRIBE DESCRIPTION OF THE FTOTTKKS 
30 Fig. 1 shows the pedigree of the individuals used in the genetic linkage studies. 

10 
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Under each individual is an ID number, the z-score for spinal BMD, and the allele calls for 
the critical markers on chromosome 11. Solid symbols represent "affected" individuals. 
Symbols containing "N" are "unaffected" individuals. DNA from 37 individuals was 
genotyped. Question marks denote unknown genotypes or individuals who were not 
5 genotyped. 

Fig. 2 depicts the BAC/STS content physical map of the HBM region in 1 lql3.3. 
STS markers derived from genes, ESTs, microsatellites, random sequences, and BAC end 
sequences are denoted above the long horizontal line. For markers that are present in GDB 
the same nomenclature has been used. Locus names (Dl 1S####) are listed in parentheses 

10 after the primary name if available. STSs derived from BAC end sequences are listed with 
the BAC name first followed by L or R for the left and right end of the clone, respectively. 
The two large arrows indicate the genetic markers that define the HBM critical region. The 
horizontal lines below the STSs indicate BAC clones identified by PCR-based screening of a 
nine-fold coverage BAC library. Open circles indicate that the marker did not amplify the 

15 corresponding BAC library address during library screening. Clone names use the following 
convention: B for BAC, the plate, row and column address, followed by -H indicating the 
HBM project (i.e., B36F16-H). 

Figs. 3 A-3F show the genomic structure of Zmaxl (LRP5) with flanking intron 
sequences. Translation is initiated by the underlined "ATG" in exon 1. The site of the 

20 polymorphism in the HBM gene is in exon 3 and is represented by the underlined "G," 

whereby this nucleotide is a "T" in the HBM gene. The 3 f untranslated region of the mRNA 
is underlined within exon 23 (exon 1, SEQ ID NO:40; exon 2, SEQ ID NO:41; exon 3, SEQ 
ID NO:42; exon 4, SEQ ID NO:43; exon 5, SEQ ID NO:44; exon 6, SEQ ID NO:45; exon 7, 
SEQ ID NO:46; exon 8, SEQ ID NO:47; exon 9, SEQ ID NO:48; exon 10, SEQ ID NO:49; 

25 exon 11, SEQ ID NO:50; exon 12, SEQ ID NO:51; exon 13, SEQ ID NO:52; exon 14, SEQ 
ID NO:53; exon 15, SEQ ID NO:54; exon 16, SEQ ID NO:55; exon 17, SEQ ID NO:56; 
exon 18, SEQ ID NO:57; exon 19, SEQ ID NO:58; exon 20, SEQ ID NO:59; exon 21, SEQ 
ID NO:60; exon 22, SEQ ID NO:61; and exon 23; SEQ ID NO:62). 

Fig. 4 shows the domain organization of Zmaxl (LRP5), including the YWTD 

30 spacers, the extracellular attachment site, the binding site for LDL and calcium, the cysteine- 
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rich growth factor repeats, the transmembrane region, the ideal PEST region with the CK-II 
phosphorylation site and the internalization domain. Fig. 4 also shows the site of the glycine 
to valine change that occurs in the HBM protein. The signal peptide is located at amino acids 
1-31, the extracellular domain is located at amino acids 32-1385, the transmembrane segment 
5 is located at amino acids 1386-1413, and the cytoplasmic domain is located at amino acids 
1414-1615. 

Fig. 5 is a schematic illustration of the BAC contigs B527D12 and B200E21 in 
relation to the HBM gene. 

Figs. 6A-6J are the nucleotide (SEQ ID NO: 1) and amino acid (SEQ ID NO:3) 
10 sequences of the wild-type gene, Zmaxl (LRP5). The location for the base pair substitution at 
nucleotide 582, a guanine to thymine, (SEQ ID NO: 2 and 4) is underlined. This allelic 
variant is the HBM gene. The HBM gene encodes for a protein with an amino acid 
substitution of glycine to valine at position 171. The 5' untranslated region (UTR) boundaries 
bases 1 to 70, and the 3' UTR boundaries bases 4916-5120. 
15 Figs. 7A and 7B are northern blot analyses showing the expression of Zmaxl (LRP5) 

in various tissues. 

Fig. 8 is a PCR product analysis. 

Fig. 9 is allele specific oligonucleotide detection of the Zmaxl (LRP5) exon 3 
mutation. 

20 Fig. 10 is the cellular localization of mouse Zmaxl (LRP5) by in situ hybridization at 

100X magnification using sense and antisense probes. 

Fig. 11 is the cellular localization of mouse Zmaxl (LRP5) by in situ hybridization at 
400X magnification using sense and antisense probes. 

Fig. 12 is the cellular localization of mouse Zmaxl (LRP5) by in situ hybridization of 
25 osteoblasts in the endosteum at 400X magnification using sense and antisense probes. 

Fig. 13 shows antisense inhibition of Zmaxl (LRP5) expression in MC-3T3 cells. 
Fig. 14 shows a Zmaxl (LRP5) exon 3 allele specific oligonucleotide (ASO) assay 
which illustrates the rarity of the HBM allele (right panels; T-specific oligo; 58 °C Wash) as 
compared to the wild-type Zmaxl (LRP5) allele (left panels, G-specific oligo; 55 °C Wash). 
30 The positive spots appearing in the right panels were positive controls. 
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Fig. 15 depicts a model representing the potential role of Zmaxl (LRP5) in focal 
adhesion signaling. 

Fig. 16 depicts a schematic of two Zmaxl (LRP5) gene targeting vectors for the 
knock-out of endogenous mouse Zmax or conditional knock-in of the HBM polymorphism. 
5 B, X, and R indicate BamHl, Xbal, and EcoRI sites in DNA BAC 473 5P5 respectively. 

Exons 3, 4, and 5 are indicated by black rectangles. A G -> T base change is engineered at 
base 24 of exon 3 to produce the HBM polymorphism. The location of a LoxP flanked 
cassette containing a neomycin resistance gene and a synthetic pause sequence and probes 
used for screening and characterizing of ES cell clones are also indicated. 
10 Fig. 17 confirms expression by the transgenic (i.e., HBMMCBA and HBMMTIC) and 

wild-type (i.e., ZmaxWTCBA and ZmaxWTTIC) plasmid constructs. These constructs were 
transiently transfected into HOB-02-02 cells and the mRNA levels determined using 
TaqMan® quantitative PCR. HBMMCBA and ZmaxWTCBA are shown in the left column 
(i.e. CMVpActin) and HBMMTIC and ZmaxWTTIC are shown in the right column {i.e. 
1 5 Type I collagen) of the Table. 

Fig. 18 depicts a comparison between the human and mouse TaqMan® Primer/Probe 
sets. HOB (HOB-03-C5) and mouse (MC-3T3-E1) osteoblastic cell mRNA was analyzed 
using the probes and primers. 

Fig. 19 depicts the quantification of human Zmaxl (LRP5) mRNA expressed in a 
20 mixed human and mouse RNA background using the TaqMan® Primer/Probe sets. Results 
are presented in Human Zmaxl (LRP5) mRNA added (ng) versus Human Zmaxl (LRP5) 
mRNA measured (ng). 

Fig. 20 depicts expression of HBM in transgenic mice based on mRNA expression 
analyzed by TaqMan®. 

25 Fig. 21A-F Panels A-C depict the analysis of various transgenic mouse lines which 

express the HBMMCBA construct in spine (Fig. 21A), femur (Fig. 21B) and total body (Fig. 
21C). Panels D-F depict the analysis of various transgenic mouse lines which express the 
HBMMTIC construct in spine (Fig. 21D), femur (Fig. 21E) and total body (Fig. 21F). 

Fig. 22 depicts changes in BMD in HBM transgenic mice (i.e., HBMMCBA and 

30 HBMMTIC constructs) at 5 weeks using in vivo pDXA* analysis. The BMD changes are 
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presented as compared to wild-type animals which were also only 5 weeks old. 

Fig. 23 depicts changes in BMD in HBM transgenic mice (i.e. 9 HBMMCBA and 
HBMMTIC constructs) at 9 weeks using in vivo pDXA* analysis. The BMD changes are 
presented as compared to wild-type animals which were also only 9 weeks old. 
5 Fig. 24 (A-D) presents the sequence of the insert of the gene, HBMGI_2AS, 

subcloned into the vector (SEQ ID NO:759). 

Fig. 25 (A-D) presents the sequence of the insert of the gene, ZMAXGI3AS, 
subcloned into the vector (SEQ ID NO.760). 

Fig. 26 (A-C) presents an alignment of human (SEQ ID NO:761) and mouse (SEQ 
10 ID NO:762) Zmaxl (LRP5) amino acid sequences. 

Fig. 27 (A-C) presents an alignment of human LRP5 (Zmaxl) (SEQ ID NO:763) and 
LRP6 (SEQ ID NO:764) amino acid sequences. 

Fig. 28 shows a schematic of the components of the Wnt signal transduction pathway. 
Schematic obtained from: http://www.stanford.edW 
15 Fig. 29 shows that the mutation G171F in LRP5 produces a greater activation of the 

Wnt pathway than LRP5 which is consistent with HBM activity. 

Fig. 30 shows that the mutation M282V in LRP5 produces an activation of the Wnt 
pathway which is consistent with HBM activity in U20S cells. 

Fig. 31 shows a table of proteins identified in a Y2H screen using a Dkk-1 bait 
20 sequence. These proteins are identified by both their nucleic acid and amino acid accession 
numbers. 

Fig. 32 shows the differential binding of an antibody generated to a sequence (a.a. 
165-177) containing the HBM mutation in LRP5 in LRP5 and HBM virus-infected cells. 

Fig. 33 shows a diagram of the Xenopus Embryo Assay for Wnt activity. 
25 Fig. 34 shows the effects of Zmax/LRP5 and HBM on Wnt signaling in the Xenopus 

embryo assay. 

Fig. 35 shows the effects of Zmax/LRP5 and HBM on induction of secondary axis 
formation in the Xenopus embryo assay. 

Fig. 36 shows the effects of human Dkk-1 on the repression of the canonical Wnt 
30 pathway. 
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Fig. 37 shows the effects of human Dkk-1 on Zmax/LRP5 and HBM-mediated Wnt 
signaling. 

Fig. 38 shows a table of peptide aptamer insert sequences identified in a Y2H screen 
using a LRP5 ligand binding domain bait sequence. 
5 Fig. 39 shows pcDNA3.1 construct names with nucleotide sequences for LRP5- 

binding peptide aptamers, Dkk-1 peptides and control constructs. 

Fig. 40 shows the results of a minimum interaction domain mapping screen of Dkk-1 
with LRP5. At the top, a map of Dkk-1 showing the location of the signal sequence, and 
cysteine rich domains 1 and 2. Below, the extent of domains examined using LRP5 LBD 
10 baits, LBD1 and LBD4. To the right, scoring of the binding results observed in the 
experiment. 

Fig. 41 shows the effects of Dkk-1 and Dkk-2 on Wntl signaling with coreceptors 
LRP5, HBM, and LRP6 in HOB03CE6 cells. 

Fig. 42 shows the effects of Dkk-1 and Dkk-2 on Wnt3a signaling with coreceptors 
15 LRP5, HBM, and LRP6 in HOB03CE6 cells. 

Fig. 43 demonstrates that the LRP5-LBD peptide aptamer 262 activates Wnt signaling 
in the presence of Wnt3a in U20S cells. 

Fig. 44 shows the amino acid sequences for the corresponding LRP5-binding 
peptides, Dkk-1 peptide aptamers and control constructs in Figure 39. 
20 Fig. 45 shows that Dkk-1 represses Wnt3a-mediated Wnt signaling in U20S bone 

cells using the cell-based reporter gene assay for high throughput screening. 

Fig. 46 demonstrates that Wntl -HBM generated signaling is not efficiently inhibited 
by Dkk-1 in U20S bone cells while LRP5 and LRP6-mediated signaling are using the cell- 
based reporter gene assay for high throughput screening. 
25 Fig. 47 shows that the TCF signal in the cell-based reporter gene assay for high 

throughput screening can be modulated by Dkk-1 and Dkk-l-AP without Wnt DNA 
transfection. 

Fig. 48 shows the morphological results in the Xenopus assay using aptamers 261 and 
262 from the LRP5-LBD to activate Wnt signaling. 
30 Fig. 49 demonstrates that LRP5-LBD aptamers 261 and 262 induce Wnt signaling 
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over other LRP5 aptamers. 

Fig. 50 depicts the LRP5 domain structure. The symbols are defined at the upper 
right. The structural domains are numbered successively from top to bottom corresponding 
to the N-terminal to C-terminal ends of the protein as Propeller 1, EGF-like Domain 1, 
Propeller 2, EGF-like Domain 2, and so forth. The structure was determined using the motifs 
and nomenclature described in Springer et aL, 1998 J. Mol Biol 283: 837-62. 

Fig. 51 CLUSTALW (1.8) multiple sequence alignment of the (3-propeller segments 
of LRP5 from human (afD77820h) and mouse (af064984m and af077847m), LRP6 from 
human (af074264h) and mouse (af074265m), and the sequences from proteins with modeled 
6-bladed p-propellers: chicken LRPl(llpx) and human nidogen (lndx). For the LPRs, the 
final suffix letter designates which of the four propeller domains the sequence comes from 
(a = > prop. #1, b = > prop. #2, etc.). The final four lines give the secondary structure 
assignments predicted by DSC for all sequences given in this alignment (H = helix, E = 
strand and C = coil) and the weights assigned to each structural type at each position on a 
scale of 0 (least probable) to 9 (most probable). Sequences corresponding to Springer's 
models' strand positions are underlined and color-coded according to which of the six 
blades they belong to. In the propeller geometry, alternate loops fall on opposite faces of 
the disc-shaped domain. The loops on the "top" face (i.e., opposite the points of entry and 
exit of the chain from the structure) are colored red. The position of the G171V mutation 
is marked with "* M . 

Fig. 52 displays the functional effect of mutations on side chain interactions in HBM 
protein as compared the wild-type Zmaxl (LRP5) protein. It also shows the side chain 
interactions of G171F, an HBM like variant. 

Fig. 53 displays the structural change due to the OPPG mutation versus the wild-type, 
Zmaxl (LRP5). Panel A depicts the homology model of LRP5's second propeller domain; 
Panel B depicts the third propeller domain and Panel C depicts the second propeller domain 
with two mutations. 
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DETAILED DESCRIPTION OF THF, TNVENTTON 

1. Definitions 

To aid in the understanding of the specification and claims, the following definitions 
are provided. 

5 "Gene" refers to a DNA sequence that encodes through its template or messenger 

RNA a sequence of amino acids characteristic of a specific peptide. The term "gene" 
includes intervening, non-coding regions, as well as regulatory regions, and can include 5' 
and 3' ends. 

By "nucleic acid" is meant to include single stranded and double stranded nucleic 

10 acids including, but not limited to DNAs, RNAs (e.g., mRNA, tRNAs, siRNAs), cDNAs, 

recombinant DNA (rDNA), rRNAs, antisense nucleic acids, oligonucleotides, and oligomers, 
and polynucleotides. The term may also include hybrids such as triple stranded regions of 
RNA and/or DNA or double stranded RNA:DNA hybrids. The term also is contemplated to 
include modified nucleic acids such as, but not limited to biotinylated nucleic acids, tritylated 

15 nucleic acids, fluorophor labeled nucleic acids, inosine, and the like. 

"Gene sequence" refers to a nucleic acid molecule, including DNA which contains a 
non-transcribed or non-translated sequence, which comprises a gene. The term is also 
intended to include any combination of gene(s), gene fragment(s), non-transcribed 
sequence(s) or non-translated sequence(s) which are present on the same DNA molecule. 

10 The nucleic acid sequences of the present invention may be derived from a variety of 

sources including DNA, cDNA, synthetic DNA, synthetic RNA or combinations thereof. 
Such sequences may comprise genomic DNA which may or may not include naturally 
occurring introns. Moreover, such genomic DNA may be obtained in association with 
promoter regions and/or poly (A) sequences. The sequences, genomic DNA or cDNA may 

25 be obtained in any of several ways. Genomic DNA can be extracted and purified from 

suitable cells by means well known in the art. Alternatively, mRNA can be isolated from a 
cell and used to produce cDNA by reverse transcription or other means. 

"cDNA" refers to complementary or copy DNA produced from an RNA template by 
the action of RNA-dependent DNA polymerase (reverse transcriptase). Thus, a "cDNA 

30 clone" means a duplex DNA sequence for which one strand is complementary to an RNA 
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molecule of interest, carried in a cloning vector or PCR amplified. cDNA can also be single 
stranded after first strand synthesis by reverse transcriptase. In this form, it is a useful PCR 
template and does not need to be carried in a cloning vector. This term includes genes from 
which the intervening sequences have been removed. Thus, the term "gene", as sometimes 
used generically, can also include nucleic acid molecules comprising cDNA and cDNA 
clones. 

"Recombinant DNA" means a molecule that has been engineered by splicing in vitro a 
cDNA or genomic DNA sequence or altering a sequence by methods such as PCR 
mutagenesis. 

"Cloning" refers to the use of in vifro recombination techniques to insert a particular 
gene or other DNA sequence into a vector molecule. In order to successfully clone a desired 
gene, it is necessary to use methods for generating DNA fragments, for joining the fragments 
to vector molecules, for introducing the composite DNA molecule into a host cell in which it 
can replicate, and for selecting the clone having the target gene from amongst the recipient 
host cells. 

"cDNA library" refers to a collection of recombinant DNA molecules containing 
cDNA inserts which together comprise the entire or a partial repertoire of genes expressed in 
a particular tissue or cell source. Such a cDNA library can be prepared by methods known to 
one skilled in the art and described by, for example, Cowell and Austin, "cDNA Library 
Protocols," Methods in Molecular Biology (1997). 

"Cloning vehicle" refers to a plasmid or phage DNA or other DNA sequence which is 
able to replicate in a host cell. This term can also include artificial chromosomes such as 
BACs and YACs. The cloning vehicle is characterized by one or more endonuclease 
recognition sites at which such DNA sequences may be cut in a determinable fashion without 
loss of an essential biological function of the DNA, which may contain a marker suitable for 
use in the identification of transformed cells. 

"Expression" refers to the process comprising transcription of a gene sequence and 
subsequent processing steps, such as translation of a resultant mRNA to produce the final end 
product of a gene. The end product may be a protein (such as an enzyme or receptor) or a 
nucleic acid (such as a tRNA, antisense RNA, or other regulatory factor). The term 
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"expression control sequence" refers to a sequence of nucleotides that control or regulate 
expression of structural genes when operably linked to those genes. These include, for 
example, the lac systems, the trp system, major operator and promoter regions of the phage 
lambda, the control region of fd coat protein and other sequences known to control the 
5 expression of genes in prokaryotic or eukaryotic cells. Expression control sequences will 
vary depending on whether the vector is designed to express the operably linked gene in a 
prokaryotic or eukaryotic host, and may contain transcriptional elements such as enhancer 
elements, termination sequences, tissue-specificity elements and/or translational initiation and 
termination sites. 

10 "Expression vehicle" refers to a vehicle or vector similar to a cloning vehicle but 

which is capable of expressing a gene which has been cloned into it, after transformation into 
a host. The cloned gene is usually placed under the control of (i.e., operably linked to) an 
expression control sequence. 

"Operator" refers to a DNA sequence capable of interacting with the specific 
1 5 repressor, thereby controlling the transcription of adjacent gene(s). 

"Promoter" refers to a DNA sequence that can be recognized by an RNA polymerase. 
The presence of such a sequence permits the RNA polymerase to bind and initiate 
transcription of operably linked gene sequences. 

"Promoter region" is intended to include the promoter as well as other gene sequences 
20 which may be necessary for the initiation of transcription. The presence of a promoter region 
is sufficient to cause the expression of an operably linked gene sequence. The term 
"promoter" is sometimes used in the art to generically indicate a promoter region. Many 
different promoters are known in the art which direct expression of a gene in a certain cell 
types. Tissue-specific promoters can comprise nucleic acid sequences which cause a greater 
25 (or decreased) level of expression in cells of a certain tissue type. 

"Operably linked" means that the promoter controls the initiation of expression of the 
gene. A promoter is operably linked to a sequence of proximal DNA if upon introduction 
into a host cell the promoter determines the transcription of the proximal DNA sequence(s) 
into one or more species of RNA. A promoter is operably linked to a DNA sequence if the 
30 promoter is capable of initiating transcription of that DNA sequence. 
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"Prokaryote" refers to all organisms without a true nucleus, including bacteria. 
"Eukaryote" refers to organisms and cells that have a true nucleus, including 
mammalian cells. 

"Host" includes prokaryotes and eukaryotes, such as yeast and filamentous fungi, as 
5 well as plant and animal cells. The term includes an organism or cell that is the recipient of a 
replicable expression vehicle. 

The term "animal 11 is used herein to include all vertebrate animals, except humans. It 
also includes an individual animal in all stages of development, including embryonic and 
fetal stages. Preferred animals include higher eukaryotes such as avians, rodents (e.g., mice, 
10 rabbits, rats, chinchillas, guinea pigs, hamsters and the like), and mammals. Preferred 

mammals include bovine, equine, feline, canine, ovine, caprine, porcine, buffalo, humans, 
and primates. 

A "transgenic animal" is an animal containing one or more cells bearing genetic 
information received, directly or indirectly, by deliberate genetic manipulation or by 

15 inheritance from a manipulated progenitor at a subcellular level, such as by microinjection or 
infection with a recombinant viral vector (e.g., adenovirus, retrovirus, herpes virus, adeno- 
associated virus, lentivirus). This introduced DNA molecule may be integrated within a 
chromosome, or it may be extra-chromosomally replicating DNA. 

"Embryonic stem cells" or !, ES cells" as used herein are cells or cell lines usually 

20 derived from embryos which are pluripotent meaning that they are un-differentiated cells. 

These cells are also capable of incorporating exogenous DNA by homologous recombination 
and subsequently developing into any tissue in the body when incorporated into a host 
embryo. It is possible to isolate pluripotent cells from sources other than embryonic tissue by 
methods which are well understood in the art. 

25 Embryonic stem cells in mice have enabled researchers to select for transgenic cells 

and perform gene targeting. This allows more genetic engineering than is possible with other 
transgenic techniques. For example, mouse ES cells are relatively easy to grow as colonies in 
vifto. The cells can be transfected by standard procedures and transgenic cells clonally 
selected by antibiotic resistance. See, for example, Doetschman et al ., 1994, Gene fransfer 

30 in embryonic stem cells. In Pinkert (Ed.) Transgenic Animal Technology: A Laboratory 
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Handbook . Academic Press Inc., New York, pp.1 15-146. Furthermore, the efficiency of this 
process is such that sufficient transgenic colonies (hundreds to thousands) can be produced to 
allow a second selection for homologous recombinants. Mouse ES cells can then be 
combined with a normal host embryo and, because they retain their potency, can develop into 
5 all the tissues in the resulting chimeric animal, including the germ cells. The transgenic 
modification can then be transmitted to subsequent generations. 

Methods for deriving embryonic stem (ES) cell lines in vitro from early 
preimplantation mouse embryos are well known. See for example, Evans et ah, 1981 Nature 
29: 154-6 and Martin, 1981, Proa Nat Acad ScL USA, 78: 7634-8. ES cells can be passaged 

10 in an undifferentiated state, provided that a feeder layer of fibroblast cells or a differentiation 
inhibiting source is present. 

The term "somatic cell" indicates any animal or human cell which is not a sperm or 
egg cell or is capable of becoming a sperm or egg cell. The term "germ cell" or "germ-line 
cell" refers to any cell which is either a sperm or egg cell or is capable of developing into a 

15 sperm or egg cell and can therefore pass its genetic information to offspring. The term "germ 
cell-line transgenic animal" refers to a transgenic animal in which the genetic information 
was incorporated in a germ line cell, thereby conferring the ability to transfer the information 
to offspring. If such offspring in fact possess some or all of that information, then they, too, 
are transgenic animals. 

20 The genetic alteration of genetic information may be foreign to the species of animal 

to which the recipient belongs, or foreign only to the particular individual recipient. In the 
last case, the altered or introduced gene may be expressed differently than the native gene. 

"Fragment" of a gene refers to any portion of a gene sequence. A "biologically active 
fragment" refers to any portion of the gene that retains at least one biological activity of that 

25 gene. For example, the fragment can perhaps hybridize to its cognate sequence or is capable 
of being translated into a polypeptide fragment encoded by the gene from which it is derived. 

"Variant" refers to a gene that is substantially similar in structure and biological 
activity or immunological characteristics to either the entire gene or to a fragment of the 
gene. Provided that the two genes possess a similar activity, they are considered variant as 

30 that term is used herein even if the sequence of encoded amino acid residues is not identical. 
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Preferentially, as used herein (unless otherwise defined) the variant is one of LRP5, HBM or 
LRP6. The variant preferably is one that yields an HBM-like phenotype (i.e., enhances bones 
mass and/or modulates lipid levels). These variants include missense mutations, single 
nucleotide polymorphisms (SNPs), mutations which result in changes in the amino acid 
sequence of the protein encoded by the gene or nucleic acid, and combinations thereof, as 
well as com in the exon domains of the HBM gene and mutations in LRP5 or LRP6 which 
result in an HBM like phenotype. 

"Amplification of nucleic acids" refers to methods such as polymerase chain reaction 
(PCR), ligation amplification (or ligase chain reaction, LCR) and amplification methods 
based on the use of Q-beta replicase. These methods are well known in the art and described, 
for example, in U.S. Patent Nos. 4,683,195 and 4,683,202. Reagents and hardware for 
conducting PCR are commercially available. Primers useful for amplifying sequences from 
the HBM region are preferably complementary to, and hybridize specifically to sequences in 
the HBM region or in regions that flank a target region therein. HBM sequences generated 
by amplification may be sequenced directly. Alternatively, the amplified sequence(s) may be 
cloned prior to sequence analysis. 

"Antibodies" may refer to polyclonal and/or monoclonal antibodies and fragments 
thereof, and immunologic binding equivalents thereof, that can bind to the HBM proteins and 
fragments thereof or to nucleic acid sequences from the HBM region, particularly from the 
HBM locus or a portion thereof. Preferred antibodies also include those capable of binding to 
LRP5, LRP6 and HBM variants. The term antibody is used both to refer to a homogeneous 
molecular entity, or a mixture such as a serum product made up of a plurality of different 
molecular entities. Proteins may be prepared synthetically in a protein synthesizer and 
coupled to a carrier molecule and injected over several months into rabbits. Rabbit sera is 
tested for immunoreactivity to the HBM protein or fragment. Monoclonal antibodies may be 
made by injecting mice with the proteins, or fragments thereof. Monoclonal antibodies will 
be screened by ELIS A and tested for specific immunoreactivity with HBM protein or 
fragments thereof. Harlow et al 9 Antibodies: A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY (1988) and Using Antibodies: A Laboratory Manual, 
Harlow, Ed and Lane, David (Cold Spring Harbor Press, 1999). These antibodies will be 
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useful in assays as well as pharmaceuticals. By "antibody" is meant to include but not 
limited to polyclonal, monoclonal, chimeric, human, humanized, bispecific, multispecific, 
primatized™ antibodies. 

"HBM protein" refers to a protein that is identical to a Zmaxl (LRP5) protein except 
5 that it contains an alteration of glycine 1 71 to a valine. An HBM protein is defined for any 
organism that encodes a Zmaxl (LRP5) true homolog. For example, a mouse HBM protein 
refers to the mouse Zmaxl (LRP5) protein having the glycine 170 to valine substitution. 

By "HBM-like" is meant a variant of LRP5, LRP6 or HBM which when expressed in 
a cell is capable of modulating bone mass, lipid levels, Dkk activity, and/or Wnt activity. 

10 In one embodiment of the present invention, "HBM gene" refers to the genomic DNA 

sequence found in individuals showing the HBM characteristic or phenotype, where the 
sequence encodes the protein indicated by SEQ ID NO: 4. The HBM gene and the Zmaxl 
(LRP5) gene are allelic. The protein encoded by the HBM gene has the property of causing 
elevated bone mass, while the protein encoded by the Zmaxl (LRP5) gene does not. The 

1 5 HBM gene and the Zmaxl (LRP5) gene differ in that the HBM gene has a thymine at position 
582, while the Zmaxl gene has a guanine at position 582. The HBM gene comprises the 
nucleic acid sequence shown as SEQ ID NO: 2. The HBM gem may also be referred to as an 
"HBM polymorphism." Other HBM genes may further have silent mutations, such as those 
discussed in Section 3 below. 

20 In alternative embodiments of the present invention, "HBM gene" may also refer to 

any allelic variant of Zmaxl (LRP5) or LRP6 which results in the HBM phenotype. Such 
variants may include alteration from the wild-type protein coding sequence as described 
herein and/or alteration in expression control sequences of Zmaxl (LRP5) or contains an 
amino acid mutation in LRP5 or LRP6, such that the resulting protein produces a phenotype 

25 which enhances bone mass and/or modulates lipid levels. A preferred example of such a 
variant is an alteration of the endogenous Zmaxl (LRP5) promoter region resulting in 
increased expression of the Zmaxl (LRP5) protein. 

"Normal," "wild-type," "unaffected", "Zmaxl", "Zmax", "LR3" and "LRP5" all refer 
to the genomic DNA sequence that encodes the protein indicated by SEQ ID NO: 3. LRP5 

30 has also been referred to LRP7 in mouse. Zmaxl, LRP5 and Zmax may be used 
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interchangeably throughout the specification and are meant to be the same gene, perhaps only 
relating to the gene in a different organism. The Zmaxl gene has a guanine at position 582 in 
the human sequence. The Zmaxl gene of human comprises the nucleic acid sequence shown 
as SEQ ID NO: 1. "Normal," "wild-type," "unaffected", "Zmaxl" and "LRP5" also refer to 
allelic variants of the genomic sequence that encodes proteins that do not contribute to 
elevated bone mass. The Zmaxl (LRP5) gene is common in the human population, while the 
HBM gene is rare. 

"5YWTD+EGF" refers to a repeat unit found in the Zmaxl (LRP5) protein, consisting 
of five YWTD repeats followed by an EGF repeat. 

"Bone development" generally refers to any process involved in the change of bone 
over time, including, for example, normal development, changes that occur during disease 
states, and changes that occur during aging. This may refer to structural changes in and 
dynamic rate changes such as growth rates, resorption rates, bone repair rates, and etc. "Bone 
development disorder" particularly refers to any disorders in bone development including, for 
example, changes that occur during disease states and changes that occur during aging. Bone 
development may be progressive or cyclical in nature. Aspects of bone that may change 
during development include, for example, mineralization, formation of specific anatomical 
features, and relative or absolute numbers of various cell types. 

"Bone modulation" or "modulation of bone formation" refers to the ability to affect 
any of the physiological processes involved in bone remodeling, as will be appreciated by one 
skilled in the art, including, for example, bone resorption and appositional bone growth, by, 
inter alia, osteoclastic and osteoblastic activity, and may comprise some or all of bone 
formation and development as used herein. 

Bone is a dynamic tissue that is continually adapting and renewing itself through the 
removal of old or unnecessary bone by osteoclasts and the rebuilding of new bone by 
osteoblasts. The nature of the coupling between these processes is responsible both for the 
modeling of bone during growth as well as the maintenance of adult skeletal integrity through 
remodeling and repair to meet the everyday needs of mechanical usage. There are a number 
of diseases of bone that result from an uncoupling of the balance between bone resorption and 
formation. With aging there is a gradual "physiologic" imbalance in bone turnover, which is 
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particularly exacerbated in women due to menopausal loss of estrogen support, that leads to a 
progressive loss of bone. The reduction in bone mass and deterioration in bone architecture 
results in an increase in bone fragility and susceptibility to spontaneous fractures. For every 
10 percent of bone that is lost the risk of fracture doubles. Individuals with bone mineral 
5 density (BMD) in the spine or proximal femur 2.5 or more standard deviations below normal 
peak bone mass are classified as osteoporotic. However, osteopenic individuals with BMD 
between 1 and 2.5 standard deviations below the norm are clearly at risk of suffering bone 
loss related disorders. 

Bone modulation may be assessed by measuring parameters such as bone mineral 

10 density (BMD) and bone mineral content (BMC) by pDXA X-ray methods, bone size, 

thickness or volume as measured by X-ray, bone formation rates as measured for example by 
calcien labeling, total, trabecular, and mid-shaft density as measured by pQCT and/or niCT 
methods, connectivity and other histological parameters as measured by mCT methods, 
mechanical bending and compressive strengths as preferably measured in femur and 

1 5 vertebrae respectively. Due to the nature of these measurements, each may be more or less 
appropriate for a given situation as the skilled practitioner will appreciate. Furthermore, 
parameters and methodologies such as a clinical history of freedom from fracture, bone 
shape, bone morphology, connectivity, normal histology, fracture repair rates, and other bone 
quality parameters are known and used in the art. Most preferably, bone quality may be 

20 assessed by the compressive strength of vertebra when such a measurement is appropriate. 
Bone modulation may also be assessed by rates of change in the various parameters. Most 
preferably, bone modulation is assessed at more than one age. 

"Normal bone density" refers to a bone density within two standard deviations of a Z 
score of 0 in the context of the HBM linkage study. In a general context, the range of normal 

25 bone density parameters is determined by routine statistical methods. A normal parameter is 
within about 1 or 2 standard deviations of the age and sex normalized parameter, preferably 
about 2 standard deviations. A statistical measure of meaningfulness is the P value which can 
represent the likelihood that the associated measurement is significantly different from the 
mean. Significant P values are P < 0.05, 0.01, 0.005, and 0.001, preferably at least P < 0.01. 

30 "HBM" refers to "high bone mass" although this term may also be expressed in terms 
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of bone density, mineral content, and size. 

The "HBM phenotype" and "HBM-like phenotype" may be characterized by an 
increase of about 2 or more standard deviations, preferably 2, 2.5, 3, or more standard 
deviations in 1, 2, 3, 4, 5, or more quantitative parameters of bone modulation, preferably 
bone density and mineral content and bone strength parameters, above the age and sex norm 
for that parameter. The HBM phenotype and HBM-like phenotype are characterized by 
statistically significant increases in at least one parameter, preferably at least 2 parameters, 
and more preferably at least 3 or more parameters. The HBM phenotype and the HBM-like 
phenotype may also be characterized by an increase in one or more bone quality parameters 
and most preferably increasing parameters are not accompanied by a decrease in any bone 
quality parameters. Most preferably, an increase in bone modulation parameters and/or bone 
quality measurements is observed at more than one age. The HBM phenotype and HBM-like 
phenotype also includes changes of lipid levels, Wnt activity and/or Dkk activity. 

A "Zmaxl system" or "LRP5 system" refers to a purified protein, cell extract, cell, 
animal, human or any other composition of matter in which Zmaxl (LRP5) is present in a 
normal or mutant form. 

The terms "isolated" and "purified" refer to a substance altered by hand of man from 
the natural environment. An isolated peptide may be for example in a substantially pure form 
or otherwise displaced from its native environment such as by expression in an isolated cell 
line or transgenic animal. An isolated sequence may for example be a molecule in 
substantially pure form or displaced from its native environment such that at least one end of 
said isolated sequence is not contiguous with the sequence it would be contiguous with in 
nature. 

A "surrogate marker" refers to a diagnostic indication, symptom, sign or other feature 
that can be observed in a cell, tissue, human or animal that is correlated with the HBM gene 
or elevated bone mass or both, but that is easier to measure than bone density. The general 
concept of a surrogate marker is well accepted in diagnostic medicine. 

The present invention encompasses the Zmaxl (LRP5) gene and Zmaxl (LRP5) 
protein in the forms indicated by SEQ ID NOS: 1 and 3, respectively, and other closely 
related variants, as well as the adjacent chromosomal regions of Zmaxl (LRP5) necessary for 
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its accurate expression. In a preferred embodiment, the present invention is directed to at 
least 15 contiguous nucleotides of the nucleic acid sequence of SEQ ID NO: 1. 

The present invention further encompasses variants of the LRP6 gene and its 
corresponding protein which result in an enhanced bone mass and/or modulate lipids and/or 
5 modulate the Wnt signaling pathway. 

"Biologically active" refers to those forms of proteins and polypeptides, including 
conservatively substituted variants, alleles of genes encoding a protein or polypeptide 
fragments of proteins which retain a biological and/or immunological activity of the wild- 
type protein or polypeptide. Preferably the activity is one which induces a change in LRP5, 
10 LRP6, or Dkk activity, such as inhibiting the interaction of LRP5 or LRP6 or variants thereof 
with Dkk, or Dkk with another ligand binding partner (e.g., Dkk-1 with a Dkk-1 interacting 
protein such as those shown in Fig. 31). By biologically active is also meant to include any 
form which modulates Wnt signaling. 

By "modulate"and "regulate" is meant methods, conditions, or agents which increase 
15 or decrease the wild-type activity of an enzyme, inhibitor, signal transducer, receptor, 

transcription activator, co-factor, and the like. This change in activity can be an increase or 
decrease of mRNA translation, mRNA or DNA transcription, and/or mRNA or protein 
degradation, which may in turn correspond to an increase or decrease in biological activity. 
By " modulated activity" and "regulated activity" is meant any activity, condition, 
20 disease or phenotype which is modulated by a biologically active form of a protein. 

Modulation may be effected by affecting the concentration or subcellular localization of 
biologically active protein, i.e., by regulating expression or degradation, or by direct agonistic 
or antagonistic effect as, for example, through inhibition, activation, binding, or release of 
substrate, modification either chemically or structurally, or by direct or indirect interaction 
25 which may involve additional factors. 

By "effective amount" or "dose effective amount" or "therapeutically effective 
amount" is meant an amount of an agent which modulates a biological activity of the 
polypeptide of the invention. 

By "immunologically active" is meant any immunoglobulin protein or fragment 
30 thereof which recognizes and binds to an antigen. 
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By "Dkk" is meant to refer to the nucleic acids and proteins of members of the Dkk 
(Dickkopf) family. This includes, but is not limited to, Dkk-1, Dkk-2, Dkk-3, Dkk-4, Soggy, 
and related Dkk proteins. Dkk-1 is a preferred embodiment of the present invention. 
However, the Dkk proteins have substantial homology and one skilled in the art will 
5 appreciate that all of the embodiments of the present invention utilizing Dkk-1 may also be 
utilized with the other Dkk proteins. 

By "Dfck-l" is meant to refer to the Dkk-1 protein and nucleic acids which encode the 
Dkk-1 protein. Dkk-1 refers to Dickkopf- 1, and in Xenopus it is related to at least Dkk-2, 
Dkk-3, and Dkk-4 {see Krupnik et al, 1999 Gene 238: 301-313). Dkk-1 was first identified 

10 in Xenopus (Glinka et al , 1 998 Nature 391 : 357-62). It was recognized as a factor capable of 
inducing ectopic head formation in the presence of inhibition of the BMP pathway. It was 
then also found to inhibit the axis-inducing activity of several Xenopus Wnt molecules by 
acting as an extracellular antagonist of Wnt signaling. Mammalian homologs have been 
found including Dkk-1, Dkk-2, Dkk-3, Dkk-4 and soggy (Fedi et al, 1999 J. Biol Chem. 

15 274: 19465-72; and Krupnick et al 1999). Human Dkk-1 was also referred to as sk (Fedi et 
al 1999). As used herein, Dkk-1 is meant to include proteins from any species having a Wnt 
pathway in which Dkk-1 interacts. Particularly preferred are mammalian species (e, g., 
murine, caprine, canine, bovine, feline, equine, primate, ovine, porcine and the like), with 
particularly preferred mammals being humans. Nucleic acid sequences encoding Dkk-1 

20 include, but are not limited to human Dkk-1 (GenBank Accession Nos. AH009834, 
XM_005730, AF261158, AF261157, AF177394, AF127563 and NM_0 12242), Mus 
musculus dickkopf homolog 1 (GenBank Accession No. NM__010051), and Danio rerio 
dickkopf-1 (GenBank Accession Nos. AF1 16852 and AB023488). The genomic sequences 
with exon annotation are GenBank Accession Nos. AF261 157 and AF261 158. Also 

25 contemplated are homologs of these sequences which have Dkk-1 activity in the Wnt 
pathway. Dkk-1 amino acid sequences include, but are not limited to human dickkopf 
homolog 1 (GenBank Accession Nos. AAG15544, BAA34651, NPJ)36374, AAF02674, 
AAD21087, and XP_005730), Danio rerio (zebrafish) dickkopfl (GenBank Accession Nos. 
BAA82135 and AAD22461) and murine dickkopf-1 (GenBank Accession Nos. 054908 and 

30 NP_0341 81). Variants and homologs of these sequences which possess Dkk-1 activity are 
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also included when referring to Dklol . 

By "LRP5 mediated", "LRP6 mediated", and "Dick mediated" disorder, condition or 
disease is any abnormal state that involves LRP5, LRP6 and/or Dkk activity. The abnormal 
state can be induced by environmental exposure or drug administration. Alternatively, the 
5 disease or disorder can be due to a genetic defect. Dick mediated diseases, disorders and 

conditions include but are not limited to bone mass disorders or conditions and lipid disorders 
and conditions. For example, bone mass disorders/conditions/diseases, which may be 
mediated by Dkk, LRP5 and/or LRP6 include but are not limited to age related loss of bone, 
bone fractures (e.g., hip fracture, Colle's fracture, vertebral crush fractures), 
10 chondrodystrophies, drug-induced disorders (e.g., osteoporosis due to administration of 

glucocorticoids or heparin and osteomalacia due to administration of aluminum hydroxide, 
anticonvulsants, or glutethimide), high bone turnover, hypercalcemia, hyperostosis, 
osteoarthritis, osteogenesis imperfecta, osteomalacia, osteomyelitis, osteoporosis, Paget's 
disease, and rickets. 

1 5 Lipid disorders/diseases/conditions, which may be mediated by Dkk, LRP5, and/or 

LRP6 include but are not limited to familial lipoprotein lipase deficiency, familial apoprotein 
CII deficiency, familial type 3 hyperlipoproteinemia, familial hypercholesterolemia, familial 
hypertriglyceridemia, multiple lipoprotein-type hyperlipidemia, elevated lipid levels due to 
dialysis and/or diabetes, and elevated lipid levels of unknown etiologies 

20 The term "recognizes and binds," when used to define interactions of antisense 

nucleotides or siRNAs (small inhibitory RNA) with a target sequence, means that a particular 
antisense or small inhibitory RNA (siRNA) sequence is substantially complementary to the 
target sequence, and thus will specifically bind to a portion of an mRNA encoding 
polypeptide. As such, typically the sequences will be highly complementary to the mRNA 

25 target sequence, and will have no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 base mismatches 

throughout the sequence. In many instances, it may be desirable for the sequences to be exact 
matches, i.e. be completely complementary to the sequence to which the oligonucleotide 
specifically binds, and therefore have zero mismatches along the complementary stretch. As 
such, highly complementary sequences will typically bind quite specifically to the target 

30 sequence region of the mRNA and will therefore be highly efficient in reducing, and/or even 
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inhibiting the translation of the target mRNA sequence into polypeptide product. 

Substantially complementary oligonucleotide sequences will be greater than about 80 
percent complementary (or >0 /o exact-match') to the corresponding mRNA target sequence to 
which the oligonucleotide specifically binds, and will, more preferably be greater than about 
5 85 percent complementary to the corresponding mRNA target sequence to which the 

oligonucleotide specifically binds. In certain aspects, as described above, it will be desirable 
to have even more substantially complementary oligonucleotide sequences for use in the 
practice of the invention, and in such instances, the oligonucleotide sequences will be greater 
than about 90 percent complementary to the corresponding mRNA target sequence to which 

10 the oligonucleotide specifically binds, and may in certain embodiments be greater than about 
95 percent complementary to the corresponding mRNA target sequence to which the 
oligonucleotide specifically binds, and even up to and including 96%, 97%, 98%, 99%, and 
even 1 00% exact match complementary to the target mRNA to which the designed 
oligonucleotide specifically binds. 

15 Percent similarity or percent complementary of any of the disclosed sequences may be 

determined, for example, by comparing sequence information using the GAP computer 
program, version 6.0, available from the University of Wisconsin Genetics Computer Group 
(UWGCG). The GAP program utilizes the alignment method of Needleman and Wunsch 
(1970). Briefly, the GAP program defines similarity as the number of aligned symbols (i.e., 

20 nucleotides or amino acids) which are similar, divided by the total number of symbols in the 
shorter of the two sequences. The preferred default parameters for the GAP program include: 
(1) a unary comparison matrix (containing a value of 1 for identities and 0 for non-identities) 
for nucleotides, and the weighted comparison matrix of Gribskov and Burgess (1986), (2) a 
penalty of 3.0 for each gap and an additional 0.10 penalty for each symbol in each gap; and 

25 (3) no penalty for end gaps. 

By "mimetic" is meant a compound or molecule that performs the same function or 
behaves similarly to the compound mimicked. 

By "reporter element" is meant a polynucleotide that encodes a polypeptide capable of 
being detected in a screening assays. Examples of polypeptides encoded by reporter elements 

30 include, but are not limited to, lacZ, GFP, luciferase, and chloramphenicol acetyltransferase. 
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2. Introduction 

The present invention also encompasses the HBM gene and HBM protein in the forms 
indicated by SEQ ID NO: 2 and 4, respectively, and other closely related variants, as well as 
the adjacent chromosomal regions of the HBM gene necessary for its accurate expression. In 
5 a preferred embodiment, the present invention is directed to at least 15 contiguous 

nucleotides of the nucleic acid sequence of SEQ ID NO: 2. More preferably, the present 
invention is directed to at least 15 contiguous nucleotides of the nucleic acid sequence of 
SEQ ID NO: 2, wherein one of the 15 contiguous nucleotides is the thymine at nucleotide 
582. 

10 The invention also relates to the nucleotide sequence of the Zmaxl (LRP5) gene 

region, as well as the nucleotide sequence of the HBM region. More particularly, a preferred 
embodiment are the B AC clones containing segments of the Zmaxl (LRP5) gene region 
B200E21-H and B527D12-H. A preferred embodiment is the nucleotide sequence of the 
BAC clones consisting of SEQ ID NOS: 5-12. 

1 5 The invention also concerns the use of the nucleotide sequence to identify DNA 

probes for the Zmaxl (LRP5) gene and the HBM gene, PCR primers to amplify the Zmaxl 
(LRP5) gene and the HBM gene, nucleotide polymorphisms in the Zmaxl (LRP5) gene and 
the HBM gene, and regulatory elements of the Zmaxl (LRP5) gene and the HBM gene. 

This invention describes the further localization of the chromosomal location of the 

20 Zmaxl (LRP5) gene and HBM gene on chromosome 1 lql3.3, between genetic markers 

Dl 1S987 and SNP_CONTIG033-6, as well as the DNA sequences of the Zmaxl (LRP5) gene 
and the HBM gene. The chromosomal location was refined by the addition of more genetic 
markers to the mapping panel used to map the gene, and by the extension of the pedigree to 
include more individuals. The pedigree extension was critical because the new individuals 

25 that have been genotyped harbor critical recombination events that narrow the region. To 
identify genes in the region on 1 lql3.3, a set of BAC clones containing this chromosomal 
region was identified. The BAC clones served as a template for genomic DNA sequencing, 
and also as a reagent for identifying coding sequences by direct cDNA selection. Genomic 
sequencing and direct cDNA selection were used to characterize more than 1.5 million base 

30 pairs of DNA from 1 lql3.3. The Zmaxl (LRP5) gene was identified within this region and 
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the HBM gene was then discovered after mutational analysis of affected and unaffected 
individuals. 

When a gene has been genetically localized to a specific chromosomal region, the 
genes in this region can be characterized at the molecular level by a series of steps that 
5 include: cloning of the entire region of DNA in a set of overlapping clones (physical 

mapping), characterization of genes encoded by these clones by a combination of direct 
cDNA selection, exon trapping and DNA sequencing (gene identification), and identification 
of mutations in these genes by comparative DNA sequencing of affected and unaffected 
members of the HBM kindred (mutation analysis). 

10 Physical mapping is accomplished by screening libraries of human DNA cloned in 

vectors that are propagated in E. coli or S. cereviseae using PCR assays designed to amplify 
unique molecular landmarks in the chromosomal region of interest. To generate a physical 
map of the HBM candidate region, a library of human DNA cloned in Bacterial Artificial 
Chromosomes (BACs) was screened with a set of Sequence Tagged Site (STS) markers that 

1 5 had been previously mapped to chromosome 1 Iql 2-ql 3 by the efforts of the Human Genome 
Project. 

STSs are unique molecular landmarks in the human genome that can be assayed by 
PCR. Through the combined efforts of the Human Genome Project, the location of thousands 
of STSs on the twenty-two autosomes and two sex chromosomes has been determined. For a 

20 positional cloning effort, the physical map is tied to the genetic map because the markers 
used for genetic mapping can also be used as STSs for physical mapping. By screening a 
BAC library with a combination of STSs derived from genetic markers, genes, and random 
DNA fragments, a physical map comprised of overlapping clones representing all of the 
DNA in a chromosomal region of interest can be assembled. 

25 BACs are cloning vectors for large (80 kilobase to 200 kilobase) segments of human . 

or other DNA that are propagated in E. coll To construct a physical map using BACs, a 
library of BAC clones is screened so that individual clones harboring the DNA sequence 
corresponding to a given STS or set of STSs are identified. Throughout most of the human 
genome, the STS markers are spaced approximately 20 to 50 kilobases apart, so that an 

30 individual BAC clone typically contains at least two STS markers. In addition, the BAC 
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libraries that were screened contain enough cloned DNA to cover the human genome six 
times over. Therefore, an individual STS typically identifies more than one BAC clone. By 
screening a six-fold coverage BAC library with a series of STS markers spaced 
approximately 50 kilobases apart, a physical map consisting of a series of overlapping BAC 
5 clones, i.e. BAC contigs, can be assembled for any region of the human genome. This map is 
closely tied to the genetic map because many of the STS markers used to prepare the physical 
map are also genetic markers. 

When constructing a physical map, it often happens that there are gaps in the STS 
map of the genome that result in the inability to identify BAC clones that are overlapping in a 

10 given location. Typically, the physical map is first constructed from a set of STSs that have 
been identified through the publicly available literature arid World Wide Web resources. The 
initial map consists of several separate BAC contigs that are separated by gaps of unknown 
molecular distance. To identify BAC clones that fill these gaps, it is necessary to develop 
new STS markers from the ends of the clones on either side of the gap. This is done by 

1 5 sequencing the terminal 200 to 300 base pairs of the BACs flanking the gap, and developing 
a PCR assay to amplify a sequence of 100 or more base pairs. If the terminal sequences are 
demonstrated to be unique within the human genome, then the new STS can be used to screen 
the BAC library to identify additional BACs that contain the DNA from the gap in the 
physical map. To assemble a BAC contig that covers a region the size of the HBM candidate 

20 region (2,000,000 or more base pairs), it is often necessary to develop new STS markers from 
the ends of several clones. 

After building a BAC contig, this set of overlapping clones serves as a template for 
identifying the genes encoded in the chromosomal region. Gene identification can be 
accomplished by many methods. Three methods are commonly used: (1) a set of BACs 

25 selected from the BAC contig to represent the entire chromosomal region can be sequenced, 
and computational methods can be used to identify all of the genes, (2) the BACs from the 
BAC contig can be used as a reagent to clone cDNAs corresponding to the genes encoded in 
the region by a method termed direct cDNA selection, or (3) the BACs from the BAC contig 
can be used to identify coding sequences by selecting for specific DNA sequence motifs in a 

30 procedure called exon trapping. The present invention includes genes identified by the first 
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two methods. 

To sequence the entire BAC contig representing the HBM candidate region, a set of 
BACs was chosen for subcloning into plasmid vectors and subsequent DNA sequencing of 
these subclones. Since the DNA cloned in the BACs represents genomic DNA, this 
5 sequencing is referred to as genomic sequencing to distinguish it from cDNA sequencing. To 
initiate the genomic sequencing for a chromosomal region of interest, several non- 
overlapping BAC clones are chosen. DNA for each BAC clone is prepared, and the clones 
are sheared into random small fragments which are subsequently cloned into standard 
plasmid vectors such as pUC18. The plasmid clones are then grown to propagate the smaller 

10 fragments, and these are the templates for sequencing. To ensure adequate coverage and 
sequence quality for the BAC DNA sequence, sufficient plasmid clones are sequenced to 
yield six-fold coverage of the BAC clone. For example, if the BAC is 100 kilobases long, 
then phagemids are sequenced to yield 600 kilobases of sequence. Since the BAC DNA was 
randomly sheared prior to cloning in the phagemid vector, the 600 kilobases of raw DNA 

15 sequence can be assembled by computational methods into overlapping DNA sequences 
termed sequence contigs. For the purposes of initial gene identification by computational 
methods, six-fold coverage of each BAC is sufficient to yield ten to twenty sequence contigs 
of 1000 base pairs to 20,000 base pairs. 

The sequencing strategy employed in this invention was to initially sequence "seed" 

20 BACs from the BAC contig in the HBM candidate region. The sequence of the "seed 11 BACs 
was then used to identify minimally overlapping BACs from the contig, and these were 
subsequently sequenced. In this manner, the entire candidate region was sequenced, with 
several small sequence gaps left in each BAC. This sequence served as the template for 
computational gene identification. One method for computational gene identification is to 

25 compare the sequence of BAC contig to publicly available databases of cDNA and genomic 
sequences, e.g., unigene, dbEST, GenBank. These comparisons are typically done using the 
BLAST family of computer algorithms and programs (Altschul et al 9 J. Mol. Biol, 215:403- 
410 (1990)). The BAC sequence can also be translated into protein sequence, and the protein 
sequence can be used to search publicly available protein databases, using a version of 

30 BLAST designed to analyze protein sequences (Altschul et al, 1997 Nucl Acids Res. 25 : 
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3389-3402). Another method is to use computer algorithms such as MZEF (Zhang, 1997 
Proa Natl Acad. Set USA, 94: 565-568) and GRAIL (Uberbacher et aL 9 1996 Methods 
Enzymol 266: 259-281), which predict the location of exons in the sequence based on the 
presence of specific DNA sequence motifs that are common to all exons, as well as the 
5 presence of codon usage typical of human protein encoding sequences. 

In addition to identifying genes by computational methods, genes were also identified 
by direct cDNA selection (Del Mastro etaL, 1995 Genome Res. 5(2): 185-194). Indirect 
cDNA selection, cDNA pools from tissues of interest are prepared, and the BACs from the 
candidate region are used in a liquid hybridization assay to capture cDNA which base-pairs to 

1 0 coding regions in the BAC. In the methods described herein, the cDNA pools were created 
from several different tissues by random priming the first strand cDNA from poly-A RNA, 
synthesizing the second strand cDNA by standard methods, and adding linkers to the ends of 
the cDNA fragments. The linkers are used to amplify the cDNA pools. The BAC clones are 
used as a template for in vifro DNA synthesis to create a biotin labeled copy of the BAC 

1 5 DNA. The biotin labeled copy of the BAC DNA is then denatured and incubated with.an 
excess of the PCR amplified, linkered cDNA pools which have also been denatured. The 
BAC DNA and cDNA are allowed to anneal in solution, and heteroduplexes between the 
BAC and the cDNA are isolated using streptavidin coated magnetic beads. The cDNA which 
is captured by the BAC is then amplified using primers complimentary to the linker 

20 sequences, and the hybridization/selection process is repeated for a second round. After two 
rounds of direct cDNA selection, the cDNA fragments are cloned, and a library of these 
direct selected fragments is created. 

The cDNA clones isolated by direct selection are analyzed by two methods. Since a 
pool of BACs from the HBM candidate region is used to provide the genomic DNA 

25 sequence, the cDNAs must be mapped to individual BACs. This is accomplished by arraying 
the BACs in microtiter dishes, and replicating their DNA in high density grids. Individual 
cDNA clones are then hybridized to the grid to confirm that they have sequence identity to an 
individual BAC from the set used for direct selection, and to determine the specific identity 
of that BAC. cDNA clones that are confirmed to correspond to individual BACs are 

30 sequenced. To determine whether the cDNA clones isolated by direct selection share 
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sequence identity or similarity to previously identified genes, the DNA and protein coding 
sequences are compared to publicly available databases using the BLAST family of 
programs. 

The combination of genomic DNA sequence and cDNA sequence provided by BAC 
sequencing and by direct cDNA selection yields an initial list of putative genes in the region. 
The genes in the region were all candidates for the HBM locus. To further characterize each 
gene, Northern blots were performed to determine the size of the transcript corresponding to 
each gene, and to determine which putative exons were transcribed together to make an 
individual gene. For Northern blot analysis of each gene, probes were prepared from direct 
selected cDNA clones or by PCR amplifying specific fragments from genomic DNA or from 
the BAC encoding the putative gene of interest. The Northern blots gave information on the 
size of the transcript and the tissues in which it was expressed. For transcripts which were 
not highly expressed, it was sometimes necessary to perform a reverse transcription PCR 
assay using RNA from the tissues of interest as a template for the reaction. 

Gene identification by computational methods and by direct cDNA selection provides 
unique information about the genes in a region of a chromosome. When genes are identified, 
then it is possible to examine different individuals for mutations in each gene. 

The present invention also encompasses the HBM gene and HBM protein in the forms 

indicated by SEQ ID NO: 2 and 4, respectively, and other closely related variants, as well as 

the adjacent chromosomal regions of the ffflMgene necessary for its accurate expression. In 

a preferred embodiment, the present invention is directed to an isolated nucleic acid sequence 

of SEQ ID NO: 2, as well as variants thereof. Variants include changes in SEQ ID NO:l 

which result in a HBM like phenotype. Examples of such variants are discussed further in 

Section 3 below and in the examples. These variants preferably have at least about 90%, 

preferably at least about 95%, or more preferably at least about 98% or more similarity or 

identity to the nucleic acid sequence of SEQ ID NOS: 1 or 2 or biologically active fragments 

thereof. Therefore, sequences which are 96%, 97%, and 99% or more similar to SEQ ED 

NOS: 1 or 2 or biologically active fragments thereof are also contemplated herein. 

Determination of the degree of variation between a high bone mass (HBM) variant 

i 

can be performed using BLAST or FASTA or other suitable algorithm using standard default 
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parameters. Preferably, identity will be determined for coding regions of SEQ ID NOS: 1-2, 
but can also include non-coding domains. Additionally, alignment programs can be used to 
identify conserved sequences or potential motifs across different animal species. Alignment 
programs can also be used to align the nucleic acid and/or protein sequences of related genes 
5 and the proteins that they encode. Preferred alignment programs include CLUSTALW, 

PELEUP and GAP, and would preferably be used with default parameters. For example, such 
programs can be used to align the sequences of Zmaxl (LRP5), HBM, and LDL receptor- 
related protein 6 (LRP6) and sequences related thereto. 

By a polynucleotide having a nucleotide sequence at least, for example, 90% "similar" 

10 to a reference nucleotide sequence encoding a polypeptide, is intended that the nucleotide 
sequence of the polynucleotide is identical to the reference sequence except that the 
polynucleotide sequence may include up to ten point mutations per each 100 nucleotides of 
the reference nucleotide sequence. These mutations of the reference sequence may occur at 
any location in SEQ ID NO: 1 or 2 or in the LRP6 gene. The mutations may be silent. 

1 5 Another embodiment contemplates that such polynucleotide variants of SEQ ID NO: 

1 or 2 comprise nucleic acid sequences which are at least 15, 20, 25, 30, 35, 40, 45, 50, 60, 
70, 80, 90, 100, 150, 200, 300, 400, or 500 contiguous nucleotides of SEQ ID NO: 1 or 2. 
More preferably, such polynucleotide variants have a contiguous nucleic acid sequence 
corresponding with the polymorphism at nucleotide 582 (G-T substitution) of SEQ ID NO: 2 

20 or other variants of SEQ ID NO: 1 or 2, which comprise a mutation which modulates bone 
mass and/or lipid levels when the polypeptide encoded thereby is administered to a subject. 
All variants of SEQ ID NO: 1 or 2 contemplated possess the characteristic of encoding a 
protein or polypeptide which when administered to a subject induces bone modulation. 
Additional variants which may be responsible for modulating bone mass when administered 

25 to a subject may lie within the domain known to contain the HBM polymorphism and which 
encodes the beta propeller domain (YWTD motifs). Alternatively, other variants of Zmaxl 
(LRP5) which modulate high bone mass in a subject may be due to mutations in the nucleic 
acid sequences encoding any of the other conserved domains of Zmaxl (LRP5), such as those 
set forth in Figure 4 (e.g., the RGD extracellular attachment site, the binding site for LDL and 

30 calcium, the cysteine rich growth factor repeats, the ideal PEST region, and the 
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internalization domain). See Section 3 and the Examples below for additional mutations 
which may confer enhance bone mass and/or lipid modulation. 

HBM polynucleotides and HBM like variants contemplated include those which 
hybridize under stringent conditions to SEQ ID NO: 2. Hybridization methods are known in 
5 the art and include, but are not limited to: (a) washing with 0.1X SSPE (0.62 M NaCl, 0.06 M 
NaH 2 P0 4 -H 2 0, 0.075 M EDTA, pH 7.4) and 0.1% sodium dodecyl sulfate (SDS) at 50°C;.(b) 
washing with 50% formamide, 5X SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM 
sodium phosphate (pH 6-8), 0.1% sodium pyrophosphate, 5X Denhardfs solution, sonicated 
salmon sperm DNA (50 Mg/ml), 0.1% SDS and 10% dextran sulfate at 42°C, followed by 

10 washing at 42°C in 0.2X SSC and 0.1% SDS; and (c) washing with of 0.5 M NaP0 4 , 7% 
SDS at 65°C followed by washing at 60°C in 0.5X SSC and 0.1% SDS. Additional 
conditions under which HBM variants can be isolated by hybridization to SEQ ID NO: 2 or 
nucleic acid fragments thereof can be performed by varying the hybridization temperature. 
High stringency hybridization conditions are those performed at about 20 °C below the 

15 melting temperature (T m ) of SEQ ID NO: 2 or fragments thereof. Preferred stringency is 

performed at about 5-10°C below the T m of SEQ ID NO: 2 or fragments thereof. Additional 
hybridization conditions can be prepared as described in Chapter 1 1 of Sambrook et al ., 
Molecular Cloning: A Laboratory Manual (1989), or as would be known to the artisan of 
ordinary skill. 

20 Alternatively, mammalian libraries (e.g., equine, primate, caprine, bovine, ovine, 

feline, porcine, and canine) can be probed using degenerate primers and polymerase chain 
reaction (PCR) techniques to identify variants of SEQ ID NO: 2 or fragments thereof. 
Preferably primers are utilized which hybridize under stringent conditions to the open reading 
frame of SEQ ID NO: 2, or to non-coding portions of the sequence. More preferably, such 

25 primers hybridize to conserved domains within SEQ ID NO: 2. For example, conserved 

domains include those coding for the YWTD beta-propeller domains or other domains, such 
as those listed in Figure 4. Preferred primers are typically 15 nucleotides in length, but can 
vary to be at least, about 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40 or 50 (and any range 
in between) nucleotides in length. Heterologous hybridization is to amplify the target gene or 

30 nucleic acid sequence using degenerate PCR primers. Probes for variants of SEQ ID NO: 2 
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and the polypeptide encoded thereby can be obtained by preparing mixed oligonucleotides of 
greater than 10, preferably of 15 or more, nucleotides in length representing all possible 
nucleotide sequences which could encode the corresponding amino acid sequences (e.g., SEQ 
ID NO: 4 fragments thereof). This method is clearly documented by Gould et al 9 1989 Proc. 
5 Natl Acad. Set USA 86(6): 1934-8. 

Another embodiment includes nucleic acids which encode an HBM polypeptide or 
HBM like variant which is at least about 90% similar to SEQ ID NO: 4 and fragments 
thereof, and which when administered to a subject modulate bone mass in that subject. HBM 
variants may have a valine corresponding to position 171 of SEQ ID NO: 4 (Gly to Val 

L0 substitution) or 170 of the mouse homolog, or an amino acid change elsewhere in propeller 1 
or in the protein which results in enhanced bone mass and/or lipid modulation. Other 
preferred embodiments include high bone mass polypeptides which have at least about 10, 
15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 500 or more 
contiguous amino acids of SEQ ID NO: 3 or 4, with a mutation in the sequence resulting in 

1 5 enhanced bone mass and/or modulated lipid levels. Such contemplated contiguous sequences 
preferably overlap with a polymorphism corresponding to high bone mass, such as valine-171 
of SEQ ID NO: 4, other mutations in propeller 1 or is predicted from the model provided in 
the Examples. Also contemplated are the polynucleotides encoding polypeptides which are at 
least about 95%, 96%, 97%, 98% and 99% or more similar to SEQ ID NO: 3 or 4 and 

10 fragments thereof, wherein these polypeptide contain at least one mutation (e.g., valine-171 
or like mutation). 

In another embodiment, a synthetic nucleic acid encoding SEQ ID NO: 4 is 
contemplated wherein the nucleic acid sequence has been conservatively substituted based on 
the degeneracy of the code such that no amino acids are altered in SEQ ID NO: 4, but perhaps 
15 wherein the resulting synthetic polynucleotide encoding said SEQ ID NO: 4 is one that is at 
least, about 50% similar to SEQ ID NO: 2. Alternatively, SEQ ID NO: 4 may contain any of 
the silent mutations identified in Section 3 below . 

By a polypeptide having an amino acid sequence of at least, for example, 95% 
"identity" to a reference amino acid sequence of SEQ ID NO: 3 or 4 or fragment thereof is 
50 intended that the amino acid sequence of the polypeptide is identical to the reference 
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sequence except that the polypeptide sequence may include up to five amino acid alterations 
per each 100 amino acids of the reference amino acid sequence of SEQ ID NO: 3 or 4 and has 
an HBM like phenotype. In other words, to obtain a polypeptide having an amino acid 
sequence 95% identical to a reference amino acid sequence, up to 5% of the amino acid 

5 residues in the reference sequence may be deleted or substituted with another amino acid, or a 
number of amino acids up to 5% of the total amino acid residues in the reference sequence 
may be inserted into the reference sequence. These alterations of the reference sequence may 
occur at the amino or carboxy terminal positions of the reference amino acid sequence or 
anywhere between those terminal positions, interspersed either individually among residues 

1 0 in the reference sequence or in one or more contiguous groups within the reference sequence. 
Additional HBM polypeptides and nucleic acids which encode said HBM 
polypeptides are contemplated wherein amino acid residues are conservatively substituted. 
For example, guidance concerning how to make phenotypically silent amino acid 
substitutions is provided in Bowie et al, "Deciphering the Message in Protein Sequences: 

15 Tolerance to Amino Acid Substitutions," Science 247: 1306-10 (1990), wherein the authors 
indicate that there are two main approaches for studying the tolerance of an amino acid 
sequence to change. The first method relies on the process of evolution, in which mutations 
are either accepted or rejected by natural selection. The second approach uses genetic 
engineering to introduce amino acid changes at specific positions of a cloned gene and 

20 selections or screens to identify sequences that maintain functionality. These studies have 
revealed that proteins are surprisingly tolerant of amino acid substitutions. The authors 
further indicate which amino acid changes are likely to be permissive at a certain position of 
the protein. Numerous phenotypic substitutions are described in Bowie et al, supra, and the 
references cited therein, which are herein incorporated by reference in their entirety. 

25 Preferred substitutions would be in domains which are less conserved across species, and 
which do not correspond to a structurally or functionally important domain (e.g., a binding 
site, catalytic site, or beta propeller or other domain described in Figure 4). 

Recent studies have indicated that LRP5 participates in the Wnt signal transduction 
pathway. The Wnt pathway is critical in limb early embryological development. A recently 

30 published sketch of the components of Wnt signaling is shown in Fig. 28 (Nusse, 2001 
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http://www.staiiford.edu/~rnusse/pathways/cell2. html) {see also, Nusse, 2001 Nature 411: 
255-6; and Mao et al, 2001 Nature 41 1 : 321-5). 

Briefly summarized, Wnt proteins are secreted proteins which interact with the 
transmembrane protein Frizzled (Fz). LRP proteins, such as LRP5 and LRP6, are believed to 
5 modulate the Wnt signal in a complex with Fz (Tamai et al , 2000 Nature 407: 530-5). The 
Wnt pathway acts intracellularly through the Disheveled protein (Dsh) which in turn inhibits 
glycogen synthetase kinase-3 (GSK3) from phosphorylating p-catenin. Phosphorylated |3- 
catenin is rapidly degraded following ubiquitination. However, the stabilized P-catenin 
accumulates and translocates to the nucleus where it acts as a cofactor of the T-cell factor 

10 (TCF) transcription activator complex. 

The protein dickkopf-1 (Dkk-1) is reported to be an antagonist of Wnt pathway. Dkk- 
1 is required for head formation in early development (Glinka et al, 1998 Nature 391: 357- 
62). Dkk-1 and its function in the Wnt pathway are described in e.g., Krupnik, et al, 1999 
Gene 238: 301-13; Fedi et al, 1999 J. Biol Chem. 274: 19465-72; see also for Dkk-1 and the 

15 Wnt pathway, Wu et al, 2000 Cwr. Biol 10: 161 1-4; Shinya et al, 2000 Meek Dev. 98: 3- 
17; Mukhopadhyay et al, 2001 Dev. Cell 1: 423-434; and in PCT Patent Application No. 
WO 00/52047, and in references cited in each. It has been known that Dkk-1 acts upstream 
of Dsh, however the nature of the mechanism of inhibition by Dkk-1 is just beginning to be 
elucidated. Dkk-1 is expressed in the mouse embryonic limb bud and its disruption results in 

20 abnormal limb morphogensis, among other developmental defects (Gotewold et al, 1999 
Meek Dev. 89:151-3; and Mukhopadhyay et al, 2001 Dev. Cell 1: 423-34). 

Related U.S. Serial No. 60/291,311 (herein incorporated by reference in its entirety) 
disclosed a novel interaction between Dkk-1 (GenBank Accession No. XM 005730) and 
LRP5. The interaction between Dkk-1 and LRP5 was discovered by a yeast two hybrid 

25 (Y2H) screen for proteins which interact with the ligand binding domain of LRP5. The two- 
hybrid screen is a common procedure in the art, which is described, for example, by Gietz et 
al, 1997 Mol. Cell Biochem. 172: 67-79; Young, 1998 Biol Reprod. 58: 302-11; Brent et 
al, 1997 Ann. Rev. Genet. 3 1 : 663-704; and Lu et al, eds., Yeast Hybrid Technologies . 
Eaton Publishing, Natick MA, (2000). More recently, other studies confirm that Dkk-1 is a 

30 binding partner for LRP and modulates the Wnt pathway via direct binding with LRP (R. 
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Nusse, 2001 Nature 41 1: 255-6; Bafico et al, 2001 Nat Cell Biol 3: 683-6; Semenov, 2001 
Curr Biol 11: 951-61; Mao, 2001 Nature 411: 321-5 (2001); Zorn, 2001 Curr. BioL 11: 
R592-5); and Li et al, 2002 1 Biol Chem. 277: 5977-81). 

Mao and colleagues (2001) identified Dkk-1 as a ligand for LRP6. Mao et al suggest 

5 that Dkk-1 and LRP6 interact antagonistically where Dklc proteins inhibit the Wnt coreceptor 
functions of LRP6. Using co-immunoprecipitation 5 the group verified that the Dkk-1 /LRP6 
interaction was direct. Dkk-2 was also found to directly bind LRP6. Contrary to data 
contained in provisional application 60/291,3 11, Mao et al report that no interaction was 
detected between any Dkk protein and LRP5, as well as no interaction with LDLR, VLDLR, 

10 ApoER, or LRP). Additionally, Mao et al demonstrated that LRP6 can titrate Dkk-l f s 

effects of inhibiting Wnt signaling using the commercial TCF-luciferase reporter gene assay 
(TOPFLASH). A similar conclusion was drawn from analogous studies in Xenopus embryos. 
Deletion analyses of LRP6 functional domains revealed that EGF repeats (beta-propellers) 3 
and 4 were necessary for Dkk-1 binding and that the ligand binding domains of LRP6 had no 

15 effect on Dkk-1 binding. The findings of Mao et al contrast with data obtained by the 

present inventors indication that the ligand binding domains of LRP5 were necessary and 
sufficient for Dkk-1 binding in yeast. Using classical biochemical ligand-receptor studies, 
Mao et al determined a Kd=0.34 nM for Dkk-1/LRP6 and a Kd=0.73 nM for Dkk-2/LRP6. 
Semenov et al (2001) verified the Mao group's results and confirmed by 

20 coimmunoprecipitation that Dkk-1 does not directly bind to Wnt or Frizzled but rather 
interacts with LRP6. Their Scatchard analyses found a Kd=0.5 nM for Dkk-1/LRP6. 
Semenov et al also demonstrated that Dkk-1 could abolish an LRP5/Frizzled8 complex 
implying that Dkk-1 can also repress Wnt signaling via interactions with LRP5. A Dkk-1 
mutant where cysteine 220 was changed to alanine abolished LRP6 binding and was unable 

25 to repress Wnt signaling. Studies in Xenopus embryos confirmed the results and revealed a 
functional consequence of Dkk-1/LRP6: repression of Wnt signaling. Their Xenopus work 
also suggested that LRP6/Dkk-1 may be specific for the canonical, p-catenin-mediated, Wnt 
pathways as opposed to the Wnt Planar Cell Polarity pathway. 

Bafico et al (2001) employed a 125 I-labeled Dkk-1 molecule to identify LRP6 as its 

30 sole membrane receptor with a Kd=0.39 nM. Again, the functional consequences of the Dkk- 
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1/LRP6 interaction was a repression of the canonical Wnt signaling even when Dkk-1 was 
added at extremely low concentrations (30 pM). 

Not wishing to be bound by theory, it is believed that the present invention provides 
an explanation for the mechanism of Dkk-1 inhibition of the Wnt pathway and provides a 
5 mechanism whereby the Wnt pathway may be modulated. The present application and 

related U.S. Serial No. 60/291,31 1 (which is herein incorporated by reference in its entirety) 
describe Dkk-1/LRP5 interactions and demonstrate that the interaction between 
LRP5/LRP6/HBM, and Dkk can be used in a method as an intervention point in the Wnt 
pathway for an anabolic bone therapeutic or a modulator of lipid metabolism. 

10 As detailed in the Examples, Dkk-1 is able to repress LRP5-mediated Wnt signaling 

but not HBM-mediated Wnt signaling. This observation is of particular interest because the 
HBM mutation in LRP5 is a gain of function or activation mutation. That is, Wnt signaling, 
via the canonical pathway, is enhanced with HBM versus LRP5. Thus, other HBM-like 
mutations would also function similarly. The present data suggest the mechanism of this 

1 5 functional activation: the inability of Dkk- 1 to repress HBM-mediated Wnt signaling. 

Further investigations of other Wnt or Dkk family members show differential activities in the 
canonical Wnt pathway that demonstrate the complexity and variability in Wnt signaling that 
can be achieved depending on the LRP/Dkk/Wnt/Frizzled repertoire that is expressed in a 
particular cell or tissue. This may attest to the apparent bone specificity of the HBM or HBM 

20 like phenotypes in humans and in the HBM transgenic animals. 

Furthermore, the present data reveal the importance and functional consequence for 
the potential structural perturbation of the first beta-propeller domain of LRP5. Our data 
identified the ligand binding domain of LRP5 as the interacting region with Dkk-1 while the 
Mao et al publication demonstrated the functional role of propellers 3 and 4 in their 

25 LRP6/Dkk-1 studies. In the present invention, we implicate the first beta propeller domain, 
via the HBM mutation at residue 171, as having a functional consequence in the Dkk-1- 
mediated Wnt pathway. The involvement of position 171 of propeller 1 may be direct or 
indirect with Dkk-1 . Direct involvement could arise from perturbations of the 3-dimensional 
structure of the HBM extracellular domain that render Dkk-1 unable to bind. Alternatively, 

30 residue 171 of propeller 1 may directly interact with Dkk-1; however, by itself, it is 
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insufficient to bind and requires other LRP5 domains. Potential indirect candidate molecules 
may be among the proteins identified the Dkk-1 yeast-two-hybrid experiments. 

It may be that the disruption of Dkk activity is not necessarily mediated by enhancing 
or preventing the binding of Dkk to LRP5/LRP6/HBM. More than one mechanism may be 
involved. Indeed, the inventors have observed that Dlck-1 binds LRP5, LRP6, and HBM. It 
is able to effectively inhibit LRP6, and to a slightly lesser extent, LRP5 activity. Further, has 
been observed that different members of the Dkk family differentially affect 
LRP5/LRP6/HBM activity. For example, Dkk-1 inhibits LRP5/LRP6/HBM activity while 
another Dkk may enhance LRP5/LRP6/HBM activity. An endpoint to consider is the 
modulation of the LRP5/LRP6/HBM activity, not simply binding. 

The present disclosure shows that targeting the modulation of the Dkk-1/LRP5 
interaction is a therapeutic intervention point for an HBM or HBM-like mimetic agent. A 
therapeutic agent of the invention may be a small molecule, peptide or nucleic acid aptamer, 
antibody, or other peptide/protein, etc. Methods of reducing Dkk-1 expression may also be 
therapeutic using methodologies such as: RNA interference (i.e., siRNAs), small hairpin 
RNAs (shRNAs), antisense oligonucleotides, morpholino oligonucleotides, PNAs, antibodies 
to Dkk-1 or Dkk-1 interacting proteins, decoy or scavenger LRP5 or LRP6 receptors, and 
knockdown of Dkk-1 or Dkk-1 interactor transcription. For discussion of small hairpin 
RNAs, see Yu et aL, 2002 Proc. Natl. Acad. Set USA 99: 6047-52; Tuschl, 2002 Nature 
Biotech. 20: 446-8; Lee et aL, 2002 Nature Biotech. 19: 500-5; Paddison et aL, 2002 Genes 
& Devel 16: 948-58; and Brummelkamp et aL, 2002 Science 296: 550-3. 

In an embodiment of the present invention, the activity of Dkk-1 or the activity of a 
Dkk-1 interacting protein may be modulated for example by binding with a peptide aptamer 
of the present invention. La another embodiment, LRP5 activity may be modulated by a 
reagent provided by the present invention {e.g., a peptide aptamer). In another embodiment, 
the Dkk-1/LRP5 interaction may be modulated by a reagent of the present invention (e.g., a 
Dkk-1 interacting protein such as those identified in Fig. 31). In another embodiment, the 
Wnt signal transduction pathway may be modulated by use of one or more of the above 
methods. In a preferred embodiment of the present invention, the Dkk-1 mediated activity of 
the Wnt pathway may be specifically modulated by one or more of the above methods. In 
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another preferred embodiment of the present invention, the Wnt signal transduction pathway 
may be stimulated by down-regulating Dkk-1 interacting protein activity; such down- 
regulation could, for example, yield greater LRP5 activity. In a more preferred embodiment, 
by stimulating LRP5 activity, bone mass regulation may be stimulated to restore or maintain 
5 a more optimal level. In another preferred embodiment, by stimulating LRP5 activity, lipid 
metabolism may be stimulated to restore or maintain a more optimal level. Alternative 
embodiments provide methods for screening candidate drugs and therapies directed to 
correction of bone mass disorders or lipid metabolism disorders. And, preferred 
embodiments of the present invention provide drugs and therapies developed by the use of 

[ 0 the reagents and/or methods of the present invention. One skilled in the art will understand 
that the present invention provides important research tools to develop an effective model of 
osteoporosis, to increase understanding of bone mass and lipid modulation, and to modulate 
bone mass and lipid metabolism. For a more detailed description of Dkk-1 and Dkk-1 
interacting protein modulation, please refer to U.S. Serial No. 60/361,293, which is herein 

15 incorporated by reference in its entirety. 

3. Alternative Variants of LRP5/LRP6 Having HBM Activity 

A structural model of the LRP5/Zmaxl first beta-propeller module was generated 
based on a model prediction in Springer et al, 1998 J. Mol Biol 283: 837-862. Based on the 

JO model, certain amino acid residues were identified as important variants of 

LRP5/HBM/Zmaxl . The model and modifications thereof are discussed in more detail in the 
Examples. The following three categories provide examples of such variants: 

The shape of the beta-propeller resembles a disk with inward-sloping sides and a hole 
down the middle. Residue 171 is in a loop on the outer or top surface of the domain in blade 

15 4 of propeller module 1. Thus, variants comprising changed residues in structurally 

equivalent positions in other blades; as well as residues that are slightly more interior to the 
binding pocket, but still accessible to the surface, are important embodiments of the present 
invention for the study of bone mass modulation by LRP5/HBM, for the development of 
pharmaceuticals and treatments of bone mass disorders, an4 for other objectives of the 

50 present invention. The following Table contains examples of such variants: 
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TABLE 1 

Va riant Effect of M utation 

A214V a position equivalent to 171 in blade 5 of propeller 1; alanine is not conserved 
in other propellers 

E128V a position equivalent to 171 in blade 3 of propeller 1; glutamate is not 
conserved in other propellers 

5 A65V a position equivalent to 171 in blade 2 of propeller 1; alanine is conserved in 
propellers 1-3 but not 4 

G199V an accessible interior position in blade 5 of propeller 1; glycine is conserved in 
propellers 1-3 but not 4 

M282V accessible interior position in blade 1 of propeller 1; methionine is conserved 
in propellers 1-3 but not 4 

These mutations were further analyzed based on a more sophisticated model, as discussed 

10 and described in Example 1 1 below. 

LRP5/Zmaxl has four beta-propeller structures; the first three beta-propeller modules 
conserve a glycine in the position corresponding to residue 171 in human LRP5/Zmaxl . 
Therefore, variants bearing a valine in the equivalent positions in the other propellers are 
important embodiments of the present invention. The following variants are useful for the 

1 5 study of bone mass modulation by LRP5/HBM, for the development of pharmaceuticals and 
treatments of bone mass disorders, and for other objectives of the present invention: G479V, 
G781V, and Q1087V of SEQ ID NO: 3, which demonstrate that propeller 1 is an important 
determinant of an HBM or HBM-like effect. 

The G171 V HBM polymorphism (SEQ ID NO: 4) results in "occupied space" of the 

20 beta-propeller 1, with the side-chain from the valine residue sticking out into an open binding 
pocket and potentially altering a ligand/protein interaction. The glycine residue is conserved 
in LRP5/Zmaxl propellers 1, 2 and 3 but is a glutamine in propeller 4. Therefore, the 
following variants of LRP5/HBM are important embodiments of the present invention for the 
study of bone mass modulation by LRP5/HBM, for the development of pharmaceuticals and 

25 treatments of bone mass disorders, and for other objectives of the present invention: 
G171K: introduces a charged side-chain 
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G171F: introduces a ringed side-chain 

G171I: introduces a branched side-chain 

G171Q: introduces the propeller 4 residue 
These substitutions along with substitutions in other regions of propeller 1 of SEQ ID NO: 3 
5 (i.e., A214V and M282V) have been shown to produce an HBM-like effect by TCF assay 

(Figs. 29 and 30). Thus these substitutions in other propeller 1 domains would similarly have 
an expectation of producing an HBM-like phenotype readily assayable by TCF assay. 

Furthermore, LRP6 is the closest homolog of LRP5/Zmaxl. LRP6 has a beta- 
propeller structure predicted to be similar, if not identical to Zmaxl . The position 
10 corresponding to glycine 171 of human LRP5/Zmaxl is glycine 158 of human LRP6. Thus, 
corresponding variants of LRP6 are an important embodiment of the present invention for the 
study of the specificity of LRP5/Zmaxl versus its related family member, for the 
development of pharmaceuticals and treatments of bone mass disorders, and for other 
objectives of the present invention. Specifically, for example, a glycine to valine substitution 
15 at the structurally equivalent position, residue 158, of human LRP6 and similar variants of 
other species' LRP6 homologs represent important research tools. 

Site-directed mutants of LRP5 were generated in the full-length human LRP5 cDNA 
using the QuikChange XL-Site-Directed Mutagenesis Kit (catalog #200516, Stratagene, La 
Jolla, CA) following the manufacturer's protocol. The mutant sequences were introduced 
10 using complementary synthetic oligonucleotides: 

Mutation Complementary oligos 

A65V: S'-TGGTCAGCGGCCTGGAGGATGTGGCCGCAGTGGACTTCC-S' 
5 f -GGAAGTCCACTGCGGCCACATCCTCCAGGCCGCTGACCA-3' 

E128V 5'-AAGCTGTACTGGACGGACTCAGTGACCAACCGCATCGAGG-3 f 
S'-CCTCGATGCGGTrGGTCACTGAGTCCGTCCAGTACAGCTT-S' 

G171K S'-ATGTACTGGACAGACTGGAAGGAGACGCCCCGGATTGAGCG-S 1 
5 -CGCTC AATCCGGGGCGTCTCCTTCCAGTCTGTCC AGTACAT-3 ' 

15 G171F 5 -ATGTACTGG AC AGACTGGTTTGAGACGCCCCGG ATTGAGCG-3 ' 

5 , -CGCTCAATCCGGGGCGTCTCAAACCAGTCTGTCCAGTACAT-3' 

Gl 711 5 ! -ATGTACTGGACAGACTGGATTGAGACGCCCCGGATTGAGCG-3' 
5 f -CGCTCAATCCGGGGCGTCTCAATCCAGTCTGTCCAGTACAT-3 , 
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Mutation Complementary oligns 

G171Q 5 ATGTACTGGACAG ACTGGC AGGAGACGCCCCGG ATTGAGCG-3 ' 5'- 
CGCTCAATCCGGGGCGTCTCCTGCCAGTCTGTCCAGTACAT-3' 

Gl 99V 5 '-CGGACATTTACTGGCCCAATGTACTGACCATCGACCTGGAGG-3 ' 
5'-CCTCCAGGTCGATGGTCAGTACATTGGGCCAGTAAATGTCCG-3' 

A2 14V 5 '-AGCTCTACTGGGCTGACGTCAAGCTCAGCTTCATCCACCG-3 1 
5'-CGGTGGATGAAGCTGAGCTTGACGTCAGCCCAGTAGAGCT-3' 

M282V 5 '-GAGTGCCCTCTACTCACCCGTGGACATCCAGGTGCTGAGCC-3 1 
5 , -GGCTCAGCACCTGGATGTCCACGGGTGAGTAGAGGGCACTC-3 , 

G479V 5'-CATGTACTGGACAGACTGGGTAGAGAACCCTAAAATCGAGTGTGC-3' 
5'-GCACACTCGATTTTAGGGTTCTCTACCCAGTCTGTCCAGTACATG-3' 

G781V 5'-CATCTACTGGACCGAGTGGGTCGGCAAGCCGAGGATCGTGCG-3' 
5 '-CGCACGATCCTCGGCTTGCCGACCCACTCGGTCCAGT AGATG-3 1 

Q1087V S'-GTACTTCACCAACATGGTGGACCGGGCAGCCAAGATCGAACG-S' 
5'-CGTTCGATCTTGGCTGCCCGGTCCACCATGTTGGTGAAGTAC-3' 

G158V of 5 '-GTACTGGACAGACTGGGTAGAAGTGCCAAAGATAGAACGTGC-3 ' 
LRP6 5'-GCACGTTCTATCTTTGGCACTTCTACCCAGTCTGTCCAGTAC-3' 



All constructs were sequence verified to ensure that only the engineered modification 
was present in the gene. Once verified, each variant was functionally evaluated in the TCF- 
luciferase assay in U20S cells (essentially as described in Example 6. Other functional 
evaluations could also be performed, such as the Xenopus embryo assay (essentially as 
described in Example 5), or other assays to evaluate Wnt signaling, Dkk modulation, or 
anabolic bone effect. Binding of these mutants to Dkk, LRP-interacting proteins, Dkk- 
interacting proteins, or peptide aptamers to any of the preceding could also be investigated in 
a variety of ways such as in a two-hybrid system (such as in yeast as described in this 
application), or other methods. 

Figure 29 shows the effects of the G171F mutation in propeller 1 of LRP5. This 
mutation is at the same position as HBM's G171V substitution. Expression of G171F results 
in an HBM effect. That is, in the presence of Wnt, G171F is able to activate the TCF- 
luciferase reporter construct. In fact, it may activate the reporter to a greater extent than 
either LRP5 or HBM. Furthermore, in the presence of Dkkl and Wntl, G171F is less 
susceptible than LRP5 to modulation by Dkk. These data exemplify that the G171F variant 

« 
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modulates Wnt signaling in a manner similar to HBM. In addition, this data confirms that 
HBM's valine residue at 171 is not the only modification at 171 that can result in an HBM 
effect. Together these data support an important role for LRP5 propeller 1 in modulating 
Wnt pathway activity; in responding to Dkk modulation; and, in the ability to generate an 
HBM effect. 

Figure 30 shows the effects of the M282V mutation in propeller 1 of LRP5. M282 
expression results in an HBM-efFect. That is, in the presence of Wnt, M282 is able to 
activate the TCF-luciferase reporter construct. Furthermore, in the presence of Dkkl and 
Wntl, M282V is less susceptible than LRP5 to modulation by Dkk. These data show that the 
M282V variant modulates Wnt signaling in a manner similar to HBM. In addition, this data 
confirms that modifications of other residues in propeller 1 of LRP5 can result in an HBM 
effect. 

These data support an "occupied space" model of the HBM mutation in propeller 1 
and show that multiple mutations of propeller 1 are capable of generating an HBM effect; the 
original G171V HBM mutation is not unique in this ability. Moreover, various perturbations 
in propeller 1 can modulate Dkk activity. 

These data illustrate the molecular mechanism of Dkk modulation of LRP signaling. 
Using the methods disclosed herein and in U.S. Application 60/290,071, generation of a 
comprehensive mutant panel will reveal residues in LRP that function in Dkk modulation of 
Wnt signaling. Such variants of LRP5 and LRP6 that modulate Dkk activity and the 
residues which distinguish them from LRP5 and LRP6 are points for therapeutic intervention 
by small molecule compound, antibody, peptide aptamer, or other agents. Furthermore, 
models of each HBM-effect mutation/polymorphism may be used in rational drag design of 
an HBM mimetic agent. 

These and the examples provided infra are only a few illustrative examples presented 
to better describe the present invention. Variants of LRP5 which have demonstrated HBM 
activity in assays include A65V, G171I, G171V (HBM), G171F, M282V, G171K, G171Q 
and A214V. Clearly, other variants may be contemplated within the scope of the present 
invention. Furthermore, wherever HBM is recited in the methods of the invention, it should 
be understood that any such alternative variant of LRP5 or LRP6 which demonstrates HBM 
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like biological activity is also encompassed by those claims. 

Additional mutations may also result in conformational changes such as those 
described above and in the Examples below. These mutations may also result in a HBM-like 
phenotype when expressed. The following mutations have been identified in Table 2 based 
on familial genetics in the exons of Zmaxl (LRP5). 

TABLE 2: Mutations in the Exons of LRP5 



Exon Location Reference Change Met as AA 





(XTG as nt 1^ 

^- V X V J (Xj 111 1 J 






AA1 


Pfiancrp 


7 
a* 


?49 


CG A GG A GGCC A TC 

±^\J£WJ VJjTYVJ vJ VvVaTx 1 v*» 


r/T 

V-// 1 




TSJnnp 


9 


zuu 


A OA PPT A PPT 


A/n- 




vJ XV 


-x 


512 


GTGAGAPGPP 


G/T 

VJ/ JL 


G171 

VJ 1/1 


VJ V 


\j 


1 199 


PGT A PPTGG A PGGG 

liVJirVvv JL VJ VJ-f\VvVJ VJVJ 


P/T 

V-// Jl 


A400 


A-V 


Q 
o 


1647 


PGGGTTP ACGCTGC 


P/T 


1 Jt7 


in vji.it; 


Q 


1 Q^9 


GGPPTTPTTGGTPT 


G/A 


F644 


T\T r^np 

IN UliC 


9 


1999 


GTGGCCATCCCGCT 


G/A 


V667 


V-M 


10 


2220 


CAGAATCGAAGTG 


C/T 


N740 


None 


14 


3107 


GGACACTGTT 


G/A 


R1036 


R-+Q. 


15 


3297 


CGGCACCGAGCG 


C/T 


D1099 


None 


15 


3357 


AGACAACACACTG 


A/G 


V1119 


None 


16 


3564 


GACTCGCATC 


G/A 


R1188 


None 


18 


3989 


CGGACTGTGA 


C/T 


A1330 


A-V 


20 


4137 


CAGCCCGGCCCAC 


C/T 


D1379 


None 


20 


4248 


GGGGGCCAACGG 


G/A 


A1416 


None 


20 


4565 


CGGCCACTGC 


C/T 


P1522 


P->L 


23 


4635 


CGACGTGTGTGACA 


C/T 


T1545 


None- 



By, for example "C/T" is meant there is a C to T mutation. 

Additional mutations have also been identified in the introns, which may result in 
splice variants are provided in Table 3 below. 
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TABLE 3: Mutations in the Introns of LRP5 





jtLXon 


iNUCicOiiae Jrosiuon 
with respect to Exon 


OCt^Uciicc Willi kjiix iJiitauuu 

Underlined 


Nucleotide 




2 


+53 


CTCTCTCTCGAATT 


C/T 




5 


-4 


1CAGTCCACACTCG 


T/C 


5 


5 


+8 


GGGCGGGGGCTGGG 


G/A 




7 


-50 


GCAGAGACCAGAC 


G/A 




8 


-118 


IGCTCTTGGGCATT 


T/G 




9 


-131 


IGGGGGTGAGTCCT 


T/C 




10 


+6 


1GTTTGCCTGTCCC 


T/C 


10 


11 


-173 


ATGTGTGTGGCAG 


A/G 




11 


-152 


GGTCTCGCCCTTC 


G/A 




11 


-49 


CGGTGAGAGCAGAC 


C/T 




11 


+37 


CGGGGCAGCCGGG 


C/T 




11 


+78 


GTACCCTGTGGCCT 


G/A 


15 


12 


+80 


CTCATCTGGGGTTC 


C/G 




12 


+141 


ATGATGCTACCTGG 


A/G 




15 


-166 


CGGGAATTTGGAGA 


C/T 




15 


-149 


ITGTTCAACTAGTA 


T/C 




15 


-52 


TCCGAGGAGACGC 


T/G 


20 


17 


-213 


1TGTTTCCGGCATC 


T/C 




17 


-82 


CATTTGGCCCCTA 


C/T 




18 


-72 


GCCCAGTCAC 


G/A 




18 


-63 


CGCCATTGCC 


C/T 




18 


-30 


GTGTGATGTT 


G/A 


25 


18 


+23 


1GATCTGGAGGAGG 


T/C 




18 


+47 


GTCTGGGCAGCTTT 


G/C 




18 


+54 


CACCGTCAGTGCT 


C/A 
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Exon Nucleotide Position Sequence with SNP Location Nucleotide 
with res pect to Exon Underlined 

22 -118 GGCACCTGCC G/A 



These splice variants may also produce an altered phenotype capable of conferring a 
HBM-like effect. 

5 These mutations were identified using 80 ng of genomic DNA, which was PCR 

amplified with the primers indicated in Table 4 below with M13F 
(TGTAAAACGACGGCCAGT) attached to the 5' end of the Forward primer or M13R 
(AGGAAACAGCTATGACCAT) attached to the 5' end of the Reverse primer. 4 /A of the 
PCR reaction was diluted to a final volume of 100 fA with water. Sequencing was performed 

10 using ET and M13F and M13R primers or by the ABI-PRISM® Big-Dye™ method of 
Applied Biosystems using the indicated nested sequencing primers. The sequences are 
assembled on consed with the appropriate reference sequence. 
The PCR mixes used are as follows 

Promega: 50 mM KC1, 10 mM Tris-HCl, pH 9.0, 0.1% Triton X-100, 1 .5 mM 
MgCl 2 

1 5 Invitrogen D: 60 mM Tris-HCl, pH 8.5, 15 mM (NH 4 ) 2 S0 4 , 3.5 mM MgCl 2 

Invitrogen J: 60 mM Tris-HCL, pH 9.5, 1 5 mM (NH 4 ) 2 S0 4 , 2.0 mM MgCl 2 

InvitrogenM: 60 mM Tris-HCl, pH 10.0, 15 mM (NH 4 ) 2 S0 4 , 1.5 mM MgCl 2 
To all the PCT mixes, 120 /uM of all 4 dNTPs was added, 0.4 /uM of the forward and reverse 

primers, 80 ng genomic DNA, 1 U of AmpliTaq® DNA polymerase, and 1.1 U of TaqStart 

>0 antibody (Clontech). The PCR reaction is than run as follows: 94°C, 2 min; (94°C, 30 sec; 

X-anneal-temp, 30 sec; 72°C, 2 min) for 35 cycles and 72°C for 3 min. 
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R-primer(seq) 


TCGCCAAGTCGCTTCCG 


1 Cj AAAAALAAL 1 IbbbC-lC. _j 

TTGGTTATTTCCGATGGG 


ATTGCAGCAGGTACCCC ! 


CAGGATTGACCTCCTGG 1 


TCMTCTCCCTCTCGCC 


CCATCGGTGCCTCGCCA 1 


s 

i 


CMGG ) J 1 1 CCCATAMGG 


ACGGACAGCCTGCCACC 


TGCAGCAAAGGCACCCA 1 


1 GCGGTTCCGGCCGCTAG 


TGTGCTGGCAGTATGAG 


ACMCGGGGAMCCCAG 


CA_I la 1 CaACaACabC- 1 C3C3LA 

MCACGCTACACACAMG 


GATTTGTTCTGCGGCAAMG 1 


CAGT ACTT AGAGGAAMTTC 


TCACAGTCTGCCTTCMG 


CCGCTGCCCTGGGAMG ! 


CTGATGCCMGCACAGG 


CAGATTATCCACMTCMC 


1 

CD 

§ 

CD 

1 


ACACMCTCAMTGCACAC 




cr 

<u 

V) 
w 
i- 
0) 

E 

'u 
Q. 

li 


1! 

8 

(J) I- 

n 


n CD 
n CD 


1 

s 

CD 
CD 
CD 

< 


AGGCTGAGGGCCCCATG 


CCTGGCTGAGTATTTCC 


CGACATTTACTGACACCA 


Iacamcamcamgcgtca I 


CTGTGCACATTGGAGCT 


CD 

cd 

ID 

I 


ACTGTGGGMTTCAGGG 


Iatgagggcggccatgtg I 


AGACTGGMTCTGCACAG 


TCCCGTGGACCTCCAGC 


D 

si 
is 


GAGCCCAGCCCAGGTGG 


TAGAGACTGGTGCAGAC 


6 

CD 

3 

6 

CD 
CD 
CD 

I 

CD 


GGCGTAGACCTCCCCAC 


1 

CD 
CD 


TCCCGTTTCACAGATGAG 


MCATGCAGTGCCCGCT 


TGGCCAGTGGACAGGCC 




CR buffer 


o 

g 1 


omega 


£ 


■omega 


omega 


•omega+10% dmso 


•omega | 


•omega 


"omega 


•omega 


•omega j 


'omega 


ivitrogen D 


•omega 


omega 


•omega 


-omega 


'omega 


-omega 


ivitrogen J 


o 
£ 

# 
o 

+ 

CO 

cn 
£ 

s 


'omega 




a. 


-a t 


5. dl 




CL 


CL 


Dl 


CL 


CL 


CL 


CL 


CL 


Q. 


£ i 


5. CL 


CL 


CL 


CL 


CL 


CL 


i— 


CL 


CL 


Prod. 


Size 


cn c 
cn r 


d ro 
^ rsi 
D in 


O 
CO 
VO 


CO 

m 
vo 


CM 

rv 
vo 


CO 


CO 

ro 
vo 


00 
VO 


? 


|V 

rv 
vo 


IV 

rv 
VO 


rsl 

IN 
VO 


VO T 

rv r 

VO L 


-1 VO 

"s in 

O VD 


cn 
cn 
vo 


CN 
VO 
VD 


CN 
CN 

rv 


Ln 

VD 


O 
VO 


ro 

VD 


00 

in 
in 


ro 

00 

vo 


meal 


dui 


vcle-68 












































G 


J8 


i r 
cn vr 


5f inl 


CNl 


001 

tnl 


vol 


col 
ml 


SI 


SI 


SI 


Si 


SI 


SI 


CMI C 

ml vr 


5! vol 


SI 


SI 


001 

ml 


a 


SI 


ml 
ml 


col 
ml 


CNl 
VOl 




R-primer(PCR) 


CGCCCCAACTCGCTCCCAAC 


V^VJttU 1 V-MV^rtV^V-V- 1 1 1 1 1 

MTACCTGAAACCATACCTG 


I 

8 
1 


CTGAGGACCMGGCGGAGAG 


GATGCMGACAGTGTCCC 


AMTAGCAGAGCACAGGCAC 


TCCTTGGCCAGATACTGTCAC | 


TTGAGGCAGGMCAGAGG 


CTMTCACTGAGGGCCAC 


! 


Itgtccatcacgmgtccag 1 


O 

I 

CD 

E 


8 [ 

i! 

li 


V-/AVJVJ 1 UUMHMU 1 V- 1 UV-V,UrtO 

C^GCTGTCAGTCTGAGGACM 


TCTTAAMCGCTGTTCGTGGT 


ATTTGTGAGATGCAGGAMCG 


GCAGGMGGTGAGATAGACCC 


B 

J 
J 

5 
3 


CD 
CD 

CD 


IGGIGGAI ITGGGIGAGAI 1 1 1 


|TTGCCTTTCACTGAGATGAC 


i 

CD 
CD 

i 




2 
u 

*r 

E 

'C 
a. 

iL 


GAGACGCGGCGCGGCTTC 


CCAAGTTCTGAGAAGTCC 


GGCGTAGTGGTGGGCATCAG 


GGGAMTTGCAGGCCGTCTG 


CACCTMCATCACCAGCC 


ACACCGACATTTACGAGCAC 


TCTCAMOW\CAMOWVCAAAM 1 


TGTGTGGCGGGMTAMG 


ATGTCT ACAAMCACGCTG 


8 

CD 
< 

i 

1 


| GAAMCCAGCAAMGCCC i 


ACAGCGATTATATCTACTGGAC 


AGMTGAMTTCCCCATAGCG 

r/irAr/irrrrrArArrA atac 


V3M\3MV3MVA*VA_V-«W*V*V-/VA 1 HV. 

CAGTTGGATTTAGGGCCTACC 


MCTGATGGCCTTCATCCC 


AGACTGATGGTATGGGCACAG 


1 
i 


s 

1 

CD 
CD 
CD 


CJ 
CD 

1 

CD 
< 

1 

< 


TTGGAGGAGGTACCATGTGTC 


i 
I 


U 

53 

CD 

I 

§ 

CD 
CD 




Exon 








in 


VO 


rv 


00 


cn 


o 


tH 


tH 

CN 

t-4 


12-2 


ro s 


1" m 


vo 

i-H 


rv 

T-t 


00 
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o 

CN 


CN 


CN 

CN 


ro 
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4. Oenotyping of Microsatellite Markers 

To nariw the genetic interval to a region smaller than that originally reported by 
Johnson etal, 1997 Am. J. Hum. Genet, 60:1326-1332, additional microsatellite markers on 
chromosome llql2-13 were typed. The new markers included: D11S4191, D11S1883, 
D11S1785,D11S4113,D11S4136,D11S4139, (EfibetaL, 1996 Nature, 380:152-154), FGF3 
(Polymeropolous etal, 1990NucL Acid Res., 18: 7468), as well as GTCJHBM_Marker_l, 
GTC_EDBM_Marker_2, GTC_HBM_Marker_3 , GTC_HBM_Marker_4, 
GTC JHBM_Marker_5 , GTC_HBM_Marker_6, and GTC_HBM_Marker_7 (See Fig. 2). 

Blood (20 ml) was drawn into lavender cap (EDTA containing) tubes by a certified 
phlebotomist. The blood was stored refrigerated until DNA extraction. DNA has been 
extracted from blood stored for up to 7 days in the refrigerator without reduction in the 
quality or quantity of yield. For those subjects that have blood drawn at distant sites, a 
shipping protocol was successfully used on more than a dozen occasions. Blood samples 
were shipped by overnight express in a styrofoam container with freezer packs to provide 
cooling. Lavender cap tubes were placed on individual plastic shipping tubes and then into 
"zip-lock" biohazard bags. When the samples arrived the next day, they were immediately 
processed to extract DNA. 

The DNA extraction procedure used a kit purchased from Gentra Systems, Inc. 
(Minneapolis, Minnesota). Briefly, the procedure involved adding 3 volumes of a red blood 
cell lysis buffer to the whole blood. After incubations for 10 minutes at room temperature, 
the solution was centrifuged in a Beckman tabletop centrifuge at 2,000 X g for 10 minutes. 

T 

The white blood cell pellet was resuspended in Cell Lysis Buffer. Once the pellet was 
completely resuspended and free of cell clumps, the solution was digested with RNase A for 
15 minutes at 37 °C. Proteins were precipitated by addition of the provided Protein 
Precipitation Solution and removed by centrifugation. The DNA was precipitated out of the 
supernatant by addition of isopropanol. This method was simple and fast, requiring only 1-2 
hours, and allowed for the processing of dozens of samples simultaneously. The yield of 
DNA was routinely >8 mg for a 20 ml sample of whole blood and had a MW of >50 kb. 
DNA was archived by storing coded 50 jag aliquots at -80°C as an ethanol precipitate. 

DNA was genotyped using one fluorescently labeled oligonucleotide primer and one 
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unlabeled oligonucleotide primer. Labeled and unlabeled oligonucleotides were obtained 
from Integrated DNA Technologies, Inc. (Coralviile, Iowa). All other reagents for 
microsatellite genotyping were purchased from Perkin Elmer- Applied Biosystems, Inc. ("PE- 
ABI") (Norwalk, Connecticut). Individual PCR reactions were performed for each marker, as 
5 described by PE-ABI using AmpliTaq™ DNA Polymerase. The reactions were added to 3.5 
of loading buffer containing deionized formamide, blue dextran and TAMRA 350 size 
standards (PE-ABI). After heating at 95 °C for 5 minutes to denature the DNA, the samples 
were loaded and electrophoresed as described in the operators manual for the Model 377 
DNA Sequencer (PE-ABI, Foster City, California). After gel electrophoresis, the data was 

10 analyzed using PE-ABI GENES CAN™ and GENOTYPER™ software. First, within the 

GENESCAN™ software, the lane tracking was manually optimized prior to the first step of 
analysis. After the gel lane data was extracted, the standard curve profiles of each lane were 
examined and verified for linearity and size calling. Lanes, which had problems with either 
of these parameters, were re-tracked and verified. Once all lanes were tracked and the size 

15 standards were correctly identified, the data were imported into GENOTYPER™ for allele 
identification To expedite allele calling (binning), the program Linkage Designer from the 
Internet web-site of Dr. Guy Van Camp (http://alt.www.uia.ac.be/u/dnalab/ld.html) was used. 
This program greatly facilitates the importing of data generated by GENOTYPER™ into the 
pedigree drawing program Cyrillic (Version 2.0, Cherwell Scientific Publishing Limited, 

20 Oxford, Great Britain) and subsequent linkage analysis using the program LINKAGE 
(Lathrop etal, 1985 Am. J. Hani. Genet 37: 482-98). 

5. Linka ge Analysis 

Fig. 1 demonstrates the pedigree of the individuals used in the genetic linkage studies 
25 for this invention. Specifically, two-point linkage analysis was performed using the MLINK 
and LINKMAP components of the program LINKAGE (Lathrop et al, 1985 Am. J. Hum. 
Genet, 37: 482-98). Pedigree/marker data was exported from Cyrillic as a pre-file into the 
Makeped program and converted into a suitable ped-file for linkage analysis. 

The original linkage analysis was performed using three models: (i) an autosomal 
30 dominant, fully penetrant model, (ii) an autosomal dominant model with reduced penetrance, 
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and (iii) a quantitative trait model. The HBM locus was mapped to chromosome 1 lql2-13 
by analyzing DNA for linked markers from 22 members of a large, extended kindred. A 
highly automated technology was used with a panel of 345 fluorescent markers which 
spanned the 22 autosomes at a spacing interval ranging from 6-22 cM. Only markers from 

5 this region of chromosome 1 1 showed evidence of linkage (LOD score -3.0). The highest 

LOD score (5.74) obtained by two-point and multipoint analysis was Dl 1S987 (map position 
55 in Fig. 2). The 95% confidence interval placed the HBM locus between markers Dl 1S905 
and Dl 1S937 (map position 41-71 in Fig. 2). Haplotype analysis also places the Zmaxl 
(LRP5) gene in this same region. Further descriptions of the markers D11S987,D11S905, 

1 0 and D 1 1 S937 can be found in Gyapay et al , 1 994 Nature Genetics, Vol. 7. 

In this invention, the inventors report the narrowing of the HBM interval to the region 
between markers Dl 1S987 and GTCJHBMJMaxker_5. These two markers lie between the 
delimiting markers from the original analysis (D11S905 and Dl 1S937) and are 
approximately 3 cM from one another. The narrowing of the interval was accomplished 

15 using genotypic data from the markers D11S4191, D11S1883, D11S1785, D11S4113, 

D11S4136, D11S4139, (Dib etal, 1996 Nature 380: 152-4), FGF3 (Polymeropolous etal, 
1990 Nucl Acid Res., 18: 7468) (information about the genetic markers can be found at the 
Internet site of the Genome Database, http://gdbwww.gdb.org/), as well as the markers 
GTCJBBMJVlarkerJ, GTC_HBM_Marker_2, GTCJHBMJVtarker_3 5 

20 GTC_HBM_Marker_4, GTC _HBM_Marker_5, GTCJIBM_Marker_6, and 
GTC_HBM_Marker_7. 

As shown in Fig. 1, haplotype analysis with the above genetic markers identifies 
recombination events (crossovers) in individuals 9019 and 9020 that significantly refine the 
interval of chromosome 1 1 to which the Zmaxl (LRP5) gene is localized. Individual 9019 is 

25 an HBM-affected individual that inherits a portion of chromosome 1 1 from the maternal 

chromosome with the HBMgenQ, and a portion from the chromosome 1 1 homologue. The 
portion inherited from the HBM gene-carrying chromosome includes markers Dl 1S935, 
D11S1313, GTCJ3BMJVIarker_4, D11S987, D11S1296, GTC_HBM_Marker_6, 
GTC_HBMJMarker_2, D11S970, GTC_HBM JVtarkerJJ , D11S4113, 

30 GTCJHBMJvlarkerJ, GTC__HBMJVlarkerJ7 and GTC_HBM_Marker_5. The portion 
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from Dl 1S4136 and continuing in the telomeric direction is derived from the non-HBM 
chromosome. This data places the Zmaxl (LRP5) gene in a location centromeric to the 
marker GTCJHBM_Marker_5. Individual 9020 is an unaffected individual who also exhibits 
a critical recombination event. This individual inherits a recombinant paternal chromosome 
5 11 that includes markers D11S935, D11S1313, GTC_HBMJMarker_4, D11S987, D11S1296 
and GTC_HBMJVIarkerJ) from her father's (individual 0115) chromosome 11 homologue 
that carries the HBM gene, and markers GTC_HBMJVIarker_2, D11S970, 
GTC_HBMJMarker_3, GTC_HBM _Marker_l , GTC_HBMJVIarkerJ7, 
GTC_HBM_Marker_5, D11S4136, D11S4139, D11S1314, andDHS937 from her father's 

10 chromosome 1 1 that does not carry the HBM gene. Marker Dl 1S41 13 is uninformative due 
to its homozygous nature in individual 0115. This recombination event places the 
centromeric boundary of the HBM region between markers Dl 1S1296 and Dl 1S987. 

Two-point linkage analysis was also used to confirm the location of the Zmaxl 
(LRP5) gene on chromosome 1 1 . The linkage results for two point linkage analysis under a 

15 model of full penetrance are presented in Table 5 below. This table lists the genetic markers 
in the first column and the recombination fractions across the top of the table. Each cell of 
the column shows the LOD score for an individual marker tested for linkage to the Zmaxl 
(LRP5) gene at the recombination fraction shown in the first row. For example, the peak 
LOD score of 7.66 occurs at marker Dl 1S970, which is within the interval defined by 

20 haplotype analysis. 
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TABLE 5 



Marker 




A AC 


A 1 
U.l 


A If 

U.15 


A 1 


A OC 

U.2D 


A 


A 1< 
U.33 


A A 


D11S935 


- infinity 


A 1A 

0.39 


A /!A 


a ah 

0.47 


A /I 1 

0.41 


A 11 

0.33 


0.2D 


U.l / 


n 1 a 


TM 1 C1 1 1 1 

Dllbl313 


- infinity 


2.64 


2.oo 


1 OA 

2.80 


Z.jy 


O 7A 

2.30 


1 Ql 


lAy 


1 AA 
l.UU 


D11S987 


- infinity 


C >i A 

5.49 


5.1o 


4.70 


All 

4.13 


1 ACS 

3.4V 


O TO 

2./y 


2.03 




Dllo411i 


A 7<C 
4.33 


7 AA 


3.62 


7 1A 

3.24 


1 Q7 

2.o3 


O AC\ 

2.40 


i.y4 


1 A6 

1.40 


A Q7 


Dl 1M33 / 


1 OQ 


2. Do 


1 Q 1 

i.ol 


1 cc 


1 in 


A QQ 

v.yy 


A *7n 


A AO 


A 1Q 


Dl lby/V) 


/.00 






J. DO 


A 7Q 


1 QQ 

o.yy 


J. 13 




1 AA 
1 .*+4 


D11S4136 


6.34 


5.79 


5.22 


4.61 


3.98 


3.30 


2.59 


1.85 


l.ll 


D11S4139 


6.80 


6.28 


5.73 


5.13 


4.50 


3.84 


3.13 


2.38 


1.59 


FGF3 


0.59 


3.23 


3.15 


2.91 


2.61 


2.25 


1.84 


1.40 
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A single nucleotide polymorphism (SNP) further defines the HBM region. This SNP 
is termed SNP_Contig033-6 and is located 25 kb centromeric to the genetic marker 
GTC_HBM_Marker_5 . This SNP is telomeric to the genetic marker GTC_HBM_Marker_7. 
SNP_Contig033-6 is present in HBM-affected individual 0113. However, the HBM-affected 
individual 9019, who is the son of 0113, does not cany this SNP. Therefore, this indicates 
that the crossover is centromeric to this SNP. The primer sequence for the genetic markers 
GTC_HBM_Marker_5 and GTC_HBM_Marker_7 is shown in Table 6 below. 



TABLE 6 



Marker 


Primer (Forward) 


Primer (Reverse) 


GTC_HBM_ 
Marker 5 


'irri'GGGTACACAATTCAGTCG 
(SEQ ID NO:63) 


AAAACTGTGGGTGCTTCTGG 
(SEQ ID NO:65) 


GTC_HBM_ 
Marker 7 


GTGATTGAGCCAATCCTGAGA 
(SEQ ID NO:64) 


TGAGCCAAATAAACCCCTTCT 
(SEQ ID NO:66) 



The kindred described have several features of great interest, notably that their bones, 
while very dense, have an absolutely normal shape. The outer dimensions of the skeletons of 
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the HBM-affected individuals are normal, and, while medullary cavities are present, there is 
no interference with hematopoiesis. The HBM-affected members seem to be resistant to 
fracture, and there are no neurologic symptoms, and no symptoms of impairment of any 
organ or system function in the members examined. HBM-affected members of the kindred 
5 live to advanced age without undue illness or disability. Furthermore, the HBM phenotype 
matches no other bone disorders such as osteoporosis, osteoporosis pseudoglioma, 
Engelmann f s disease, Ribbing ! s disease, hyperphosphatasemia, Van Buchem's disease, 
melorheostosis, osteopetrosis, pycnodysostosis, sclerostenosis, osteopoikilosis, acromegaly, 
Pagefs disease, fibrous dysplasia, tubular stenosis, osteogenesis imperfecta, 

10 hypoparathyroidism, pseudohypoparathyroidism, pseudopseudohypoparathyroidism, primary 
and secondary hyperparathyroidism and associated syndromes, hypercalciuria, medullary 
carcinoma of the thyroid gland, osteomalacia and other diseases. Clearly, the HBM locus in 
this family has a very powerful and substantial role in regulating bone density, and its 
identification is an important step in understanding the pathway(s) that regulate bone density 

15 and the pathogenesis of diseases such as osteoporosis. 

In addition, older individuals carrying the HBM gone, and therefore expression of the 
HBM protein, do not show loss of bone mass characteristic of normal individuals. In other 
words, the HBM gene is a suppressor of osteoporosis. In essence, individuals carrying the 
HBM gene are dosed with the HBM protein, and, as a result, do not develop osteoporosis. 

20 This in vivo observation is strong evidence that treatment of normal individuals with the 
HBM gene or protein, or a fragment thereof, will ameliorate osteoporosis. 

6. Physical Mapp ing 

To provide reagents for the cloning and characterization of the HBM locus, the 
25 genetic mapping data described above were used to construct a physical map of the region 

containing Zmaxl (LRP5) on chromosome 1 lql3.3. The physical map consists of an ordered 
set of molecular landmarks, and a set of BAC clones that contain the Zmaxl (LRP5) gene 
region from chromosome llql3.3. 

Various publicly available mapping resources were utilized to identify existing STS 
30 markers (Olson et aL, 1989 Science 245:1434-5) in the HBM region. Resources included the 
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GDB, the Whitehead Institute Genome Center, dbSTS and dbEST (NCBI), 1 ldb, the 
University of Texas Southwestern GESTEC, the Stanford Human Genome Center, and 
several literature references (Courseaux etaL, 1997 Genomics 40: 13-23; Courseaux et aL, 
1996 Genomics 37: 354-65; Guru et aL, 1997 Genomics 42: 436-45; Hosoda et aL, 1997 
Genes Cells 2: 345-57; James et aL, 1994 Nat Genet. 8: 70-76; Kitamura et aL, 1997 DNA 
Research, 4: 281-9; Lemmens et aL, 1997 Genomics 44: 94-100; and Smith et aL, 1997 
Genome Res. 7: 835-42). Maps were integrated manually to identify markers mapping to the 
region containing Zmaxl (LRP5). 

Primers for existing STSs were obtained from the GDB or literature references are 
listed in Table 6 below. Thus, Table 7 shows the STS markers used to prepare the physical 
map of the Zmaxl (LKP5) gene region. 
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Novel STSs were developed either from publicly available genomic sequence or from 
sequence-derived B AC insert ends. Primers were chosen using a script which automatically 
performs vector and repetitive sequence masking using Cross_match (P. Green, Univ. of 
Washington) and subsequent primer picking using Primer3 (Rozen, Skaletsky (1996, 1997). 
Primer3 is available at: ww.genome.wi.nnt.edu/genome_software/other/primer3 .html. 

Polymerase chain reaction (PCR) conditions for each primer pair were initially 
optimized with respect to MgCl 2 concentration. The standard buffer was 10 mM Tris-HCl 
(pH 8.3), 50 mM KC1, MgCl 2 , 0.2 mM each dNTP, 0.2 uM each primer, 2.7 ng/ul human 
DNA, 0.25 units of AmpliTaq (Perkin Elmer) and MgCl 2 concentrations of 1.0 mM, 1.5 mM, 
2.0 mM or 2.4 mM. Cycling conditions included an initial denaturation at 94°C for 2 
minutes followed by 40 cycles at 94°C for 15 seconds, 55°C for 25 seconds, and 72°C for 25 
seconds followed by a final extension at 72°C for 3 minutes. Depending on the results from 
the initial round of optimization the conditions were further optimized if necessary. 
Variables included increasing the annealing temperature to 58°C or 60°C, increasing the 
cycle number to 42 and the annealing and extension times to 30 seconds, and using 
AmpliTaq™ Gold (Perkin Elmer). 

BAC clones (Kim et al, 1996 Genomics 32: 213-8; and Shizuya et ah, 1992 Proc. 
Natl. Acad. Set USA 89: 8794-7) containing STS markers of interest were obtained by PCR- 
based screening of DNA pools from a total human BAC library purchased from Research 
Genetics. DNA pools derived from library plates 1-596 were used corresponding to nine 
genomic equivalents of human DNA. The initial screening process involved PCR reactions 
of individual markers against superpools, i.e., a mixture of DNA derived from all BAC clones 
from eight 384-well library plates. For each positive superpool, plate (8), row (16) and 
column (24), pools were screened to identify a unique library address. PCR products were 
electrophoresed in 2% agarose gels (Sigma) containing 0.5 ug/ml ethidium bromide in IX 
TBE at 150 volts for 45 min. The electrophoresis units used were the Model A3-1 systems 
from Owl Scientific Products. Typically, gels contained 10 tiers of lanes with 50 wells/tier. 
Molecular weight markers (100 bp ladder, Life Technologies, Bethesda, MD) were loaded at 
both ends of the gel. Images of the gels were captured with a Kodak DC40 CCD camera and 
processed with Kodak ID software. The gel data were exported as tab delimited text files; 
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names of the files included information about the library screened, the gel image files and the 
marker screened. These data were automatically imported using a customized Perl script into 
Filemaker™ PRO (Claris Coip.) databases for data storage and analysis. In cases where 
incomplete or ambiguous clone address information was obtained, additional experiments 
5 were performed to recover a unique, complete library address. 

Recovery of clonal BAC cultures from the library involved streaking out a sample 
from the library well onto LB agar (Maniatis et aL 9 Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1982)) containing 12.5 
p,g/ml chloramphenicol (Sigma). Two individual colonies and a portion of the initial streak 
10 quadrant were tested with appropriate STS markers by colony PCR for verification. Positive 
clones were stored in LB broth containing 12.5 ^ig/ml chloramphenicol and 15% glycerol at - 
70°C. 

Several different types of DNA preparation methods were used for isolation of BAC 
DNA. The manual alkaline lysis miniprep protocol listed below (Maniatis et al 9 1982) was 

15 successfully used for most applications, i.e., restriction mapping, CHEF gel analysis, FISH 
mapping, but was not successfully reproducible by endsequencing. The Autogen and Qiagen 
protocols were used specifically for BAC DNA preparation for endsequencing purposes. 

Bacteria were grown in 15 ml Terrific Broth containing 12.5 fxg/ml chloramphenicol 
in a 50 ml conical tube at 37°C for 20 hrs with shaking at 300 rpm. The cultures were 

20 centrifuged in a Sorvall RT 6000 D at 3000 rpm (~1800xg) at 4°C for 15 min. The 

supernatant was then aspirated as completely as possible. In some cases, cell pellets were 
frozen at -20 °C at this step for up to 2 weeks. The pellet was then vortexed to homogenize 
the cells and minimize clumping. 250 jlxI of PI solution (50 mM glucose, 15 mM Tris-HCl, 
pH 8, 10 mM EDTA, and 100 ng/ml RNase A) was added and the mixture pipetted up and 

25 down to mix. The mixture was then transferred to a 2 ml Eppendorf tube. 350 \xl of P2 

solution (0.2 N NaOH, 1% SDS) was then added, the mixture mixed gently and incubated for 
5 min at room temperature. 350 jjlI of P3 solution (3 M KOAc, pH 5.5) was added and the 
mixture mixed gently until a white precipitate formed. The solution was incubated on ice for 
5 min and then centrifuged at 4°C in a microfuge for 10 min. The supernatant was 

30 transferred carefully (avoiding the white precipitate) to a fresh 2 ml Eppendorf tube, and 0.9 
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ml of isopropanol was added, the solution mixed and left on ice for 5 min. The samples were 
centrifuged for 10 min, and the supernatant removed carefully. Pellets were washed in 70% 
ethanol and air dried for 5 min. Pellets were resuspended in 200 ul of TE8 (10 mM Tris-HCl, 
pH 8.0, 1.0 mM EDTA), and RNase A (Boehringer Mannheim) added to 100 ug/ml. 
Samples were incubated at 37 °C for 30 min, then precipitated by addition of C 2 H 3 0 2 Na-3H 2 0 
to 0.5 M and 2 volumes of ethanol. Samples were centrifuged for 10 min, and the pellets 
washed with 70% ethanol followed by air drying and dissolving in 50 (A TE8. Typical yields 
for this DNA prep were 3-5 yug/l 5 ml bacterial culture. Ten to 15 ^1 were used for HmBU 
restriction analysis; 5 ul was used forNotl digestion and clone insert sizing by CHEF gel 
electrophoresis. 

BACs were inoculated into 15 ml of 2X LB Broth containing 12.5 ug/ml 
chloramphenicol in a 50 ml conical tube. Four tubes were inoculated for each clone. 
Cultures were grown overnight (-16 hr) at 37°C with vigorous shaking (>300 rpm). 
Standard conditions for B AC DNA isolation were followed as recommended by the Autogen 
740 manufacturer. 3 ml samples of culture were placed into Autogen tubes for a total of 60 
ml or 20 tubes per clone. Samples were dissolved finally in 100 ul TE8 with 15 seconds of 
shaking as part of the Autogen protocol. After the Autogen protocol was finished DNA 
solutions were transferred from each individual tube and pooled into a 2 ml Eppendorf tube. 
Tubes with large amounts of debris (carry over from the pelleting debris step) were avoided. 
The tubes were then rinsed with 0.5 ml of TE8 successively and this solution added to the 
pooled material. DNA solutions were stored at 4°C; clumping tended to occur upon freezing 
at -20 °C. This DNA was either used directly for restriction mapping, CHEF gel analysis or 
FISH mapping or was further purified as described below for use in endsequencing reactions. 

The volume of DNA solutions was adjusted to 2 ml with TE8, samples were then 
mixed gently and heated at 65 °C for 10 min. The DNA solutions were then centrifuged at 
4°C for 5 min and the supernatants transferred to a 15 ml conical tube. The NaCl 
concentration was then adjusted to 0.75 M (-0.3 ml of 5 M NaCl to the 2 ml sample). The 
total volume was then adjusted to 6 ml with Qiagen column equilibration buffer (Buffer 
QBT). The supernatant containing the DNA was then applied to the column and allowed to 
enter by gravity flow. Columns were washed twice with 10 ml of Qiagen Buffer QC. Bound 
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DNA was then eluted with four separate 1 ml aliquots of Buffer QF kept at 65 °C. DNA was 
precipitated with 0.7 volumes of isopropanol (-2.8 ml). Each sample was then transferred to 
4 individual 2.2 ml Eppendorf tubes and incubated at room temperature for 2 hr or overnight. 
Samples were centrifuged in a microfiige for 10 min. at 4°C. The supernatant was removed 
5 carefully and 1 ml of 70% ethanol was added. Samples were centrifuged again and because 
the DNA pellets were often loose at this stage, the supernatant removed carefully. Samples 
were centrifuged again to concentrate remaining liquid which was removed with a micropipet 
tip. DNA pellets were then dried in a desiccator for 10 min. 20 |il of sterile distilled and 
deionized H 2 0 was added to each tube which was then placed at 4°C overnight. The four 20 

10 (il samples for each clone were pooled and the tubes rinsed with another 20 |il of sterile 

distilled and deionized H 2 0 for a final volume of 100 jil. Samples were then heated at 65 °C 
for 5 min. and then mixed gently. Typical yields were 2-5 [ig/60 ml culture as assessed by 
Not! digestion and comparison with uncut lambda DNA. 

3 ml of LB Broth containing 12.5 fig/ml of chloramphenicol was dispensed into 

1 5 autoclaved Autogen tubes. A single tube was used for each clone. For inoculation, glycerol 
stocks were removed from -70 °C storage and placed on dry ice. A small portion of the 
glycerol stock was removed from the original tube with a sterile toothpick and transferred 
into the Autogen tube; the toothpick was left in the Autogen tube for at least two minutes 
before discarding. After inoculation the tubes were covered with tape making sure the seal 

20 was tight. When all samples were inoculated, the tube units were transferred into an Autogen 
rack holder and placed into a rotary shaker at 37°C for 16-17 hours at 250 rpm. Following 
growth, standard conditions for B AC DNA preparation, as defined by the manufacturer, were 
used to program the Autogen. Samples were not dissolved in TE8 as part of the program and 
DNA pellets were left dry. When the program was complete, the tubes were removed from 

25 the output tray and 30 |al of sterile distilled and deionized H 2 0 was added directly to the 

bottom of the tube. The tubes were then gently shaken for 2-5 seconds and then covered with 
parafilm and incubated at room temperature for 1-3 hours. DNA samples were then 
transferred to an Eppendorf tube and used either directly for sequencing or stored at 4°C for 
later use. 

30 
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7. BAC Clone Characterization for Physical Mapping 

DNA samples prepared either by manual alkaline lysis or the Autogen protocol were 
digested with HindUL for analysis of restriction fragment sizes. This data were used to 
compare the extent of overlap among clones. Typically 1-2 fig were used for each reaction. 
5 Reaction mixtures included: IX Buffer 2 (New England Biolabs), 0.1 mg/ml bovine serum 

albumin (New England Biolabs), 50 ng/ml RNase A (Boehringer Mannheim), and 20 units of 
HindUI (New England Biolabs) in a final volume of 25 |xl. Digestions were incubated at 
37 °C for 4-6 hours. BAC DNA was also digested with NotI for estimation of insert size by 
CHEF gel analysis (see below). Reaction conditions were identical to those for HindUI 

10 except that 20 units of Notl were used. Six \il of 6X Ficoll loading buffer containing 
bromphenol blue and xylene cyanol was added prior to electrophoresis. 

HindHI digests were analyzed on 0.6% agarose (Seakem, FMC Bioproducts) in IX 
TBE containing 0.5 jig/ml ethidium bromide. Gels (20 cm X 25 cm) were electrophoresed in 
a Model A4 electrophoresis unit (Owl Scientific) at 50 volts for 20-24 hrs. Molecular weight 

1 5 size markers included undigested lambda DNA, HindSL digested lambda DNA, and HaeTR 
digested _X174 DNA. Molecular weight markers were heated at 65 °C for 2 min prior to 
loading the gel. Images were captured with a Kodak DC40 CCD camera and analyzed with 
Kodak ID software. 

Notl digests were analyzed on a CHEF DRU (BioRad) electrophoresis unit according 
20 to the manufacturer's recommendations. Briefly, 1% agarose gels (BioRad pulsed field 

grade) were prepared in 0.5X TBE, equilibrated for 30 minutes in the electrophoresis unit at 
14°C, and electrophoresed at 6 volts/cm for 14 hrs with circulation. Switching times were 
ramped from 10 sec to 20 sec. Gels were stained after electrophoresis in 0.5 ng/ml ethidium 
bromide. Molecular weight markers included undigested lambda DNA, HindHI digested 
25 lambda DNA, lambda ladder PFG ladder, and low range PFG marker (all from New England 
Biolabs). 

BAC DNA prepared either by the manual alkaline lysis or Autogen protocols were 
labeled for FISH analysis using a Bioprime labeling kit (BioRad) according to the 
manufacturer's recommendation with minor modifications. Approximately 200 ng of DNA 
30 was used for each 50 pi reaction. 3 \i\ were analyzed on a 2% agarose gel to determine the 
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extent of labeling. Reactions were purified using a Sephadex G50 spin column prior to in 
situ hybridization. Metaphase FISH was performed as described (Ma et al, 1996 Cytogenet 
Cell Genet, 74: 266-71). 

5 8. BACEndsequencing 

The sequencing of BAC insert ends utilized DNA prepared by either of the two 
methods described above. The DYEnamic energy transfer primers and Dynamic Direct cycle 
sequencing kits from Amersham were used for sequencing reactions. Ready made 
sequencing mix including the M13 -40 forward sequencing primer was used (Catalog # 

10 US79730) for the T7 BAC vector terminus; ready made sequencing mix (Catalog # 

US79530) was mixed with the M13 -28 reverse sequencing primer (Catalog # US79339) for 
the SP6 BAC vector terminus. The sequencing reaction mixes included one of the four 
fluorescently labeled dye-primers, one of the four dideoxy termination mixes, dNTPs, 
reaction buffer, and Thermosequenase. For each BAC DNA sample, 3 \il of the BAC DNA 

1 5 sample was aliquoted to 4 PCR strip tubes. 2 ^1 of one of the four dye primer/termination 
mix combinations was then added to each of the four tubes. The tubes were then sealed and 
centrifuged briefly prior to PCR. Thermocycling conditions involved a 1 minute denaturation 
at 95°C, 15 second annealing at 45°C, and extension for 1 minute at 70°C for 35 total cycles. 
After cycling the plates were centrifuged briefly to collect all the liquid to the bottom of the 

20 tubes. 5 \i\ of sterile distilled and deionized H 2 0 was then added into each tube, the plates 
sealed and centrifuged briefly again. The four samples for each BAC were then pooled 
together. DNA was then precipitated by adding 1.5 jil of 7.5 M NH 4 OAc and 100 |xl of - 
20°C 100% ethanol to each tube. Samples were mixed by pipetting up and down once. The 
plates were then sealed and incubated on ice for 10 minutes. Plates were centrifuged in a 

25 table topHaraeus centrifuge at 4000 rpm (3,290xg) for 30 minutes at 4°C to recover the 
DNA. The supernatant was removed and excess liquid blotted onto paper towels. Pellets 
were washed by adding 100 jul of -20 °C 70% ethanol into each tube and re-centrifuging at 
4000 rpm (3,290xg) for 10 minutes at 4°C. The supernatant was removed and excess liquid 
again removed by blotting on a paper towel. Remaining traces of liquid were removed by 

30 placing the plates upside down over a paper towel and centrifiiging only until the centrifuge 

71 



WO 02/092000 



PCT/US02/14877 



reached 800 rpm. Samples were then air dried at room temperature for 30 min. Tubes were 
capped and stored dry at -20 °C until electrophoresis. Immediately prior to electrophoresis 
the DNA was dissolved in 1.5 ^il of Amersham loading dye. Plates were then sealed and 
centrifuged at 2000 rpm (825Xg). The plates were then vortexed on a plate shaker for 1-2 
5 minutes. Samples were then recentrifiiged at 2000 rpm (825Xg) briefly. Samples were then 
heated at 65 °C for 2 min and immediately placed on ice. Standard gel electrophoresis was 
performed on ABI 377 fluorescent sequencers according to the manufacturer's 
recommendation. 

10 9. Sub-cloning and Sequencing of TTBM B AC DNA 

The physical map of the Zmaxl (LRP5) gene region provides a set of BAC clones that 
contain within them the Zmaxl (LRP5) gene and the HBM gene. DNA sequencing of several 
of the B ACs from the region has been completed. The DNA sequence data is a unique 
reagent that includes data that one skilled in the art can use to identify the Zmaxl (LRP5) 

1 5 gene and the HBM gene, or to prepare probes to identify the gene(s), or to identify DNA 
sequence polymorphisms that identify the gene(s). 

BAC DNA was isolated according to one of two protocols, either a Qiagen 
purification of BAC DNA (Qiagen, Inc. as described in the product literature) or a manual 
purification which is a modification of the standard alkaline lysis/Cesium Chloride 

20 preparation of plasmid DNA (see e.g., Ausubel et al, Current Protocols in Molecular 

Biology, John Wiley & Sons (1997)). Briefly for the manual protocol, cells were pelleted, 
resuspended in GTE (50 mM glucose, 25 mM Tris-Cl (pH 8), 10 mM EDTA) and lysozyme 
(50 mg/ml solution), followed by NaOH/SDS (1% SDS/0.2 N NaOH) and then an ice-cold 
solution of 3 M KOAc (pH 4.5-4.8). RNaseA was added to the filtered supernatant, followed 

25 by Proteinase K and 20% SDS. The DNA was then precipitated with isopropanol, dried and 
resuspended in TE (10 mM Tris, 1 mM EDTA (pH 8.0)). The BAC DNA was further 
purified by Cesium Chloride density gradient centrifugation (Ausubel et al, 1997). 

Following isolation, the BAC DNA was sheared hydrodynamically using an HPLC 
(Hengen, 1997 Trends in Biochem. Set 22: 273-4) to an insert size of 2000-3000 bp. After 

30 shearing, the DNA was concentrated and separated on a standard 1% agarose gel. A single 
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fraction, corresponding to the approximate size, was excised from the gel and purified by 
electroelution (Sambrook et aL, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring, NY (1989)). 

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. The 
5 blunt-ended DNA was then ligated to unique fefXI-linker adapters (5 f - GTCTTCACCACG 
GGG-3 1 and 5 f -GTGGTGAAGAC-3 f in 100-1000 fold molar excess; SEQ ID NOS: 627 and 
628 respectively). These linkers were complimentary to the BstXL-cut pMPX vectors 
(constructed by the inventors), while the overhang was not self-complimentary. Therefore, 
the linkers would not concatemerize nor would the cut-vector religate itself easily. The 

10 linker-adapted inserts were separated from the unincorporated linkers on a 1% agarose gel 
and purified using GeneClean (BIO 101, Inc.). The linker-adapted insert was then ligated to 
a modified pBlueScript vector to construct a "shotgun" subclone library. The vector 
contained an out-of-frame lacZ gene at the cloning site which became in-frame in the event 
that an adapter-dimer is cloned, allowing these to be avoided by their blue-color. 

15 All subsequent steps were based on sequencing by ABI377 automated DNA 

sequencing methods. Only major modifications to the protocols are highlighted. Briefly, the 
library was then transformed into DH5a competent cells (Life Technologies, Bethesda, MD, 
DH5a transformation protocol). It was assessed by plating onto antibiotic plates containing 
ampicillin and IPTG/Xgal. The plates were incubated overnight at 37° C. Successful 

20 transfomiants were then used for plating of clones and picking for sequencing. The cultures 
were grown overnight at 37°. DNA was purified using a silica bead DNA preparation (Ng et 
al, 1996 Nucl Acids Res. 24: 5045-7) method. In this manner, 25 \xg of DNA was obtained 
per clone. 

These purified DNA samples were then sequenced using ABI dye-terminator 
25 chemistry. The ABI dye terminator sequence reads were run on ABI377 machines and the 

data was directly transferred to UNIX machines following lane tracking of the gels. All reads 
were assembled using PHRAP (P. Green, Abstracts of DOE Human Genome Program 
Contractor-Grantee Workshop V, Jan. 1 996, p. 1 57) with default parameters and quality 
scores. The initial assembly was done at 6-fold coverage and yielded an average of 8-15 
30 contigs. Following the initial assembly, missing mates (sequences from clones that only 
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gave one strand reads) were identified and sequenced with ABI technology to allow the 
identification of additional overlapping contigs. Primers for walking were selected using a 
Genome Therapeutics program Pickj>rimer near the ends of the clones to facilitate gap 
closure. These walks were sequenced using the selected clones and primers. Data were 
reassembled with PHRAP into sequence contigs. 

10, G ene Identification by Computational M ethods 

Following assembly of the BAC sequences into contigs, the contigs were subjected to 
computational analyses to identify coding regions and regions bearing DNA sequence 
similarity to known genes. This protocol included the following steps. 

1 . Degap the contigs: the sequence contigs often contain symbols 
(denoted by a period symbol) that represent locations where the individual ABI 
sequence reads have insertions or deletions. Prior to automated computational 
analysis of the contigs, the periods were removed. The original data was maintained 
for future reference. 

2. BAC vector sequences were "masked" within the sequence by using 
the program cross match (Phil Green, http.A\chimera.biotech.washington.edu\UWGC). 
Since the shotgun libraries construction detailed above leaves some BAC vector in the 
shotgun libraries, this program was used to compare the sequence of the BAC contigs 
to the BAC vector and to mask any vector sequence prior to subsequent steps. 
Masked sequences were marked by an "X" in the sequence files, and remained inert 
during subsequent analyses. 

3. E. coli sequences contaminating the BAC sequences were masked by 
comparing the BAC contigs to the entire E. coli DNA sequence. 

4. Repetitive elements known to be common in the human genome were 
masked using cross match. In this implementation of crossmatch, the BAC sequence 
was compared to a database of human repetitive elements (Jerzy Jerka, Genetic 
Information Research Institute, Palo Alto, CA). The masked repeats were marked by 
X and remained inert during subsequent analyses. 

5. The location of exons within the sequence was predicted using the 
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MZEF computer program (Zhang, 1997 Proc. Natl Acad. Sci USA. 94: 565-8). 

6. The sequence was compared to the publicly available unigene database 
(National Center for Biotechnology Information, National Library of Medicine, 3 8 A, 
8N905, 8600 Rockville Pike, Bethesda, MD 20894; ww.ncbi.nlm.nih.gov) using 
the blastn2 algorithm (Altschul et al 9 1997 Nucl Acids Res. 25: 3389-3402). The 
parameters for this search were: E=0.05, v=50, B=50 (where E is the expected 
probability score cutoff, V is the number of database entries returned in the reporting 
of the results, and B is the number of sequence alignments returned in the reporting of 
the results (Altschul et al, J. Mol Biol, 215:403-410 (1990)). 

7. The sequence was translated into protein for all six reading frames, and 
the protein sequences were compared to a non-redundant protein database compiled 
from Genpept Swissprot PIR (National Center for Biotechnology Information, 
National Library of Medicine, 38A, 8N905, 8600 Rockville Pike, Bethesda, MD 
20894; www.ncbi.nlm.nih.gov). The parameters for this search were E=0.05, V=50 5 
B= 50, where E, V, and B are defined as above. 

8. The BAC DNA sequence was compared to the database of the cDNA 
clones derived from direct selection experiments (described below) using blastn2 
(Altschul et al 9 1997). The parameters for this search were E=0.05, V=250, B-250, 
where E, V, and B are defined as above. 

9. The BAC sequence was compared to the sequences of all other B ACs 
from the HBM region on chromosome 1 lql2-13 using blastn2 (Altschul et al, 1997). 
The parameters for this search were E=0.05, V=50, B=50, where E, V, and B are 
defined as above. 

10. The BAC sequence was compared to the sequences derived from the 
ends of BACs from the HBM region on chromosome 1 lql2-13 using blastn2 
(Altschul et aL, 1997). The parameters for this search were E=0.05, V=50, B=50, 
where E, V, and B are defined as above. 

1 1 . The BAC sequence was compared to the GenBank database (National 
Center for Biotechnology Information, National Library of Medicine, 38A, 8N905, 
8600 Rockville Pike, Bethesda, MD 20894; www.ncbi.nlm.nih.gov) using blastn2 
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(Altschul et al, 1997). The parameters for this search were E=0.05, V=50, B=50, 
where E, V, and B are defined as above. 

12. The BAC sequence was compared to the STS division of GenBank 
database (National Center for Biotechnology Information, National Library of 
Medicine, 38A, 8N905, 8600 Rockville Pike, Bethesda, MD 20894; 
www.ncbi.nlm.nih.gov) using blastn2 (Altschul et al 9 1997). The parameters for this 
search were E=0.05, V=50, B= 50, where E, V, and B are defined as above. 

13. The BAC sequence was compared to the Expressed Sequence (EST) 
Tag GenBank database (National Center for Biotechnology Information, National 
Library of Medicine, 38A, 8N905, 8600 Rockville Pike, Bethesda, MD 20894; 
www.ncbi.nlm.nih.gov) using blastn2 (Altschul et al 9 1997). The parameters for this 
search were E=0.05, V=250, B=250, where E, V, and B are defined as above. 

11. Gene Identification by Direct cPNA Selection 

Primary linkered cDNA pools were prepared from bone marrow, calvarial bone, 
femoral bone, kidney, skeletal muscle, testis and total brain. Poly (A) + RNA was prepared 
from calvarial and femoral bone tissue (Chomczynski et al. 9 1987 Anal Biochem. 162: 156-9; 
and D'Alessio et al 9 1987 Focus 9: 1-4) and the remainder of the mRNA was purchased from 
Clontech (Palo Alto, California). In order to generate oligo(dT) and random primed cDNA 
pools from the same tissue, 2.5 jiig mRNA was mixed with oligo(dT) primer in one reaction 
and 2.5 jug mRNA was mixed with random hexamers in another reaction, and both were 
converted to first and second strand cDNA according to manufacturers recommendations 
(Life Technologies, Bethesda, MD). Paired phosphorylated cDNA linkers (see sequence 
below) were annealed together by mixing in a 1:1 ratio (10 p.g each) incubated at 65 °C for 
five minutes and allowed to cool to room temperature. 
Paired linkers oligo l A 

OLIGO 1: 5'-CTG AGC GGA ATT CGT GAG ACC-3' (SEQ ED NO:12) 

OLIGO 2: 5'-TTG GTC TCA CGT ATT CCG CTC GA-3' (SEQ ID NO: 1 3) 
Paired linkers oligo3/4 

OLIGO 3: 5'-CTC GAG AAT TCT GGA TCC TC-3' (SEQ ID NO:14) 
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OLIGO 4: 5'-TTG AGG ATC CAG AAT TCT CGA G-3» (SEQ ID NO: 1 5) 
Paired linkers oligo5/6 

OLIGO 5: 5'-TGT ATG CGA ATT CGC TGC GCG-3' (SEQ ID NO:16) 

OLIGO 6: 5'-TTC GCG CAG CGA ATT CGC ATA CA-3' (SEQ ID NO:17) 
5 Paired linkers oligo7/8 

OLIGO 7: 5'-GTC CAC TGA ATT CTC AGT GAG-3 1 (SEQ ID NO: 18) 

OLIGO 8: 5-TTG TCA CTG AGA ATT CAG TGG AC-3' (SEQ ID NO: 19) 
Paired linkers oligo 11/12 

OLIGO 1 1 : 5'-GAA TCC GAA TTC CTG GTC AGC-3 1 (SEQ ID NO:20) 
10 OLIGO 12: 5-TTG CTG ACC AGG AAT TCG GAT TC-3' (SEQ ID NO:21) 

Linkers were ligated to all oligo(dT) and random primed cDNA pools (see below) 
according to manufacturers instructions (Life Technologies, Bethesda, MD). 

Oligo Vz was ligated to oligo(dT) and random primed cDNA pools prepared from 
bone marrow. Oligo 3/4 was ligated to oligo(dT) and random primed cDNA pools prepared 
15 from calvarial bone. Oligo 5/6 was ligated to oligo(dT) and random primed cDNA pools 
prepared from brain and skeletal muscle. Oligo 7/8 was ligated to oligo(dT) and random 
primed cDNA pools prepared from kidney. Oligo 1 1/12 was ligated to oligo(dT) and random 
primed cDNA pools prepared from femoral bone. 

The cDNA pools were evaluated for length distribution by PCR amplification using 1 
20 jLtl of a 1 : 1 , 1 : 1 0, and 1:100 dilution of the ligation reaction, respectively. PCR reactions 

were performed in a Perkin Elmer 9600, each 25 jj.1 volume reaction contained 1 jul of DNA, 
10 mM Tris-HCl (pH 8.3), 50 mM KC1, 1 .5 mM MgCl 2 , 0.001% gelatin, 200 mM each 
dNTPs, 10 jjM primer and 1 unit Taq DNA polymerase (Perkin Elmer) and was amplified 
under the following conditions: 30 seconds at 94°C, 30 seconds at 60°C and 2 minutes at 
25 72 °C for 30 cycles. The length distribution of the amplified cDNA pools were evaluated by 
electrophoresis on a 1% agarose gel. The PCR reaction that gave the best representation of 
the random primed and oligo(dT) primed cDNA pools was scaled up so that -2-3 ^ig of each 
cDNA pool was produced. The starting cDNA for the direct selection reaction comprised of 
0.5 \ig of random primed cDNAs mixed with 0.5 \ig of oligo(dT) primed cDNAs. 
30 The DNA from the 54 BACs that were used in the direct cDNA selection procedure 
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was isolated using Nucleobond AX columns as described by the manufacturer (The Nest 
Group, Inc.). 

The BACs were pooled in equimolar amounts and 1 fig of the isolated genomic DNA 
was labeled with biotin 16-UTP by nick translation in accordance with the manufacturers 
instructions (Boehringer Mannheim). The incorporation of the biotin was monitored by 
methods that could be practiced by one skilled in the art (Del Mastro et al 9 Methods in 
Molecular Biology, Humana Press Inc., NJ (1996)). 

Direct cDNA selection was performed using methods that could be practiced by one 
skilled in the art (Del Mastro et al 9 1996). Briefly, the cDNA pools were multiplexed in two 
separate reactions: In one reaction cDNA pools from bone marrow, calvarial bone, brain and 
testis were mixed, and in the other cDNA pools from skeletal muscle, kidney and femoral 
bone were mixed. Suppression of the repeats, yeast sequences and plasmid in the cDNA 
pools was performed to a Cot of 20. 100 ng of biotinylated BAC DNA was mixed with the 
suppressed cDNAs and hybridized in solution to a Cot of 200. The biotinylated DNA and the 
cognate cDNAs was captured on streptavidin-coated paramagnetic beads. The beads were 
washed and the primary selected cDNAs were eluted. These cDNAs were PGR amplified 
and a second round of direct selection was performed. The product of the second round of 
direct selection is referred to as the secondary selected material. A Galanin cDNA clone, 
previously shown to map to 1 lql2-13 (Evans, 1993 Genomics 18: 473-7), was used to 
monitor enrichment during the two rounds of selection. 

The secondary selected material from bone marrow, calvarial bone, femoral bone, 
kidney, skeletal muscle, testis and total brain was PCR amplified using modified primers of 
oligos 1, 3, 5, 7 and 1 1, shown below, and cloned into the UDG vector pAMPIO (Life 
Technologies, Bethesda, MD), in accordance with the manufacturer's recommendations. 

Modified primer sequences: 

Primer SEQ Sequence 
ID NO, 

Oligo 1 22 5'-CUA CUA CUA CUA CTG AGC GGA ATT CGT GAG ACC-3' 

Oligo 3 23 S'-CUA CUA CUA CUA CTC GAG AAT TCT GGA TCC TC-3' 

Oligo 5 24 5'-CUA CUA CUA CUA TGT ATG CGA ATT CGC TGC GCG-3' 
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Primer SEQ Sequence 
TP NO, 

Oligo 7 25 5'-CUA CUA CUA CUA GTC CAC TGA ATT CTC AGT GAG-3' 

Oligo 1 1 26 5'-CUA CUA CUA CUA GAA TCC GAA TTC CTG GTC AGC-3' 

The cloned secondary selected material, from each tissue source, was transformed into 
MAX Efficiency DH5a Competent Cells (Life Technologies, Bethesda, MD) as 
recommended by the manufacturer. 384 colonies were picked from each transformed source 
and arrayed into four 96 well microtiter plates. 

All secondarily selected cDNA clones were sequenced using Ml 3 dye primer 
terminator cycle sequencing kit (Applied Biosystems), and the data collected by the ABI 377 
automated fluorescence sequencer (Applied Biosystems). 

All sequences were analyzed using the BLASTN, BLASTX and FASTA programs 
(Altschul et al, 1990 J. Mol Biol 215: 403-410; and Altschul et aL 9 Nacl Acids. Res. 25: 
3389-3402). The cDNA sequences were compared to a database containing sequences 
derived from human repeats, mitochondrial DNA, ribosomal RNA, E. coli DNA to remove 
background clones from the dataset using the program cross_match. A further round of 
comparison was also performed using the program BLASTN2 against known genes 
(GenBank) and the BAC sequences from the HBM region. Those cDNAs that were >90% 
homologous to these sequences were filed according to the result and the data stored in a 
database for further analysis. cDNA sequences that were identified but did not have 
significant similarity to the BAC sequences from the HBM region or were eliminated by 
cross_match were hybridized to nylon membranes which contained the B ACs from the HBM 
region, to ascertain whether they hybridized to the target. 

Hybridization analysis was used to map the cDNA clones to the BAC target that 
selected them. The BACs that were identified from the HBM region were arrayed and grown 
into a 96 well microtiter plate. LB agar containing 25 fxg/ml kanamycin was poured into 96 
well microtiter plate lids. Once the agar had solidified, pre-cut Hybond N+ nylon membranes 
(Amersham) were laid on top of the agar and the BACs were stamped onto the membranes in 
duplicate using a hand held 96 well replica plater (V&P Scientific, Inc.). The plates were 
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incubated overnight at 37 °C. The membranes were processed according to the manufacturers 
recommendations. 

The cDNAs that needed to be mapped by hybridization were PCR amplified using the 
relevant primer (oligos 1, 3, 5, 7 and 1 1) that would amplify that clone. For this PCR 
5 amplification, the primers were modified to contain a linkered digoxigenin molecule at the 5 f 
of the oligonucleotide. The PCR amplification was performed under the same conditions as 
described in Preparation of cDNA Pools (above). The PCR products were evaluated for 
quality and quantity by electrophoresis on a 1% agarose gel by loading 5 |il of the PCR 
reaction. The nylon membranes containing the stamped BACs were individually pre- 

10 hybridized in 50 ml conical tubes containing 10 ml of hybridization solution (5X SSPE, 0.5X 
Blotto, 2.5% SDS and 1 mM EDTA (pH 8.0)). The 50 ml conical tubes were placed in a 
rotisserie oven (Robbins Scientific) for 2 hours at 65 °C. Twenty-five ng of each cDNA 
probe was denatured and added into individual 50 ml conical tubes containing the nylon 
membrane and hybridization solution. The hybridization was performed overnight at 65 °C. 

1 5 The filters were washed for 20 minutes at 65 °C in each of the following solutions: 3X SSPE, 
0.1% SDS; IX SSPE, 0.1% SDS andO.lX SSPE, 0.1% SDS. 

The membranes were removed from the 50 ml conical tubes and placed in a dish. 
Acetate sheets were placed between each membrane to prevent them from sticking to each 
other. The incubation of the membranes with the Anti-DIG-AP and CDP-Star was performed 

20 according to manufacturers recommendations (Boehringer Mannheim). The membranes were 
wrapped in Saran wrap and exposed to Kodak Bio-Max X-ray film for 1 hour. 

12. cDNA Clon ing and Ex pression Analysis 

To characterize the expression of the genes identified by direct cDNA selection and 
25 genomic DNA sequencing in comparison to the publicly available databases, a series of 
experiments were performed to further characterize the genes in the HBM region. First, 
oligonucleotide primers were designed for use in the polymerase chain reaction (PCR) so that 
portions of a cDNA, EST, or genomic DNA could be amplified from a pool of DNA 
molecules (a cDNA library) or RNA population (RT-PCR and RACE). The PCR primers 
30 were used in a reaction containing genomic DNA to verify that they generated a product of 
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the size predicted based on the genomic (BAC) sequence. A number of cDNA libraries were 
then examined for the presence of the specific cDNA or EST. The presence of a fragment of 
a transcription unit in a particular cDNA library indicates a high probability that additional 
portions of the same transcription unit will be present as well. 
5 A critical piece of data that is required when characterizing novel genes is the length, 

in nucleotides, of the processed transcript or messenger RNA (mRNA). One skilled in the art 
primarily determines the length of an mRNA by Northern blot hybridization (Sambrook et 
aL, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor NY (1989)). Groups of ESTs and direct-selected cDNA clones that displayed 

1 0 significant sequence similarity to sequenced BACs in the critical region were grouped for 
convenience into approximately 30 kilobase units. Within each 30 kilobase unit there were 
from one up to fifty ESTs and direct-selected cDNA clones which comprised one or more 
independent transcription units. One or more ESTs or direct-selected cDNAs were used as 
hybridization probes to determine the length of the mRNA in a variety of tissues, using 

15 commercially available reagents (Multiple Tissue Northern blot; Clontech, Palo Alto, 
California) under conditions recommended by the manufacturer. 

Directionally cloned cDNA libraries from femoral bone, and calvarial bone tissue 
were constructed by methods familiar to one skilled in the art (for example, Soares in 
Automated DNA Sequencing and Analysis, Adams, Fields and Venter, Eds., Academic Press, 

20 NY, pages 1 10-1 14 (1994)). Bones were initially broken into fragments with a hammer, and 
the small pieces were frozen in liquid nitrogen and reduced to a powder in a tissue pulverizer 
(Spectrum Laboratory Products). RNA was extracted from the powdered bone by 
homogenizing the powdered bone with a standard Acid Guanidinium 
Thiocyanate-Phenol-Chloroform extraction buffer (e,g., Chomczynski et aL, 1987 Anal 

25 Biochem. 162: 156-9) using a polytron homogenizer (Brinkman Instruments). Additionally, 
human brain and lung total RNA was purchased from Clontech. PolyA RNA was isolated 
from total RNA using dynabeads-dT according to the manufacturer's recommendations 
(Dynal, Inc.). First strand cDNA synthesis was initiated using an oligonucleotide primer with 
the sequence: S 1 - A A CTGG A AG A ATTC GCGflCCGC A GG AATTTTTTTTTTTTTTTT 

30 TT-3 1 (SEQ ID NO:27). This primer introduces zNoil restriction site (underlined) at the 3', 
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end of the cDNA. First and second strand synthesis were performed using the "one-tuhe" 
cDNA synthesis kit as described by the manufacturer (Life Technologies, Bethesda, MD). 
Double stranded cDNAs were treated with T4 polynucleotide kinase to ensure that the ends 
of the molecules were blunt (Soares in Automated DNA Sequencing and Analysis, Adams, 
Fields and Venter, Eds., Academic Press, NY, pages 1 10-1 14 (1994)), and the blunt ended 
cDNAs were then size selected by a Biogel column (Huynh et al. in DNA Cloning, Vol. 1, 
Glover, Ed., IRL Press, Oxford, pages 49-78 (1985)) or with a size-sep 400 Sepharose 
column (Pharmacia, catalog # 27-5105-01). Only cDNAs of 400 base pairs or longer were 
used in subsequent steps. EcdBl adapters (sequence: 5' OH-AATTCGGCACGAG-OH 3' 
(SEQ ID NO:28), and 5' p-CTCGTGCCG-OH 3' (SEQ ID NO:29)) were then ligated to the 
double stranded cDNAs by methods familiar to one skilled in the art (Soares, 1994). The 
EcoKL adapters were then removed from the 3' end of the cDNA by digestion with Notl 
(Soares, 1994). The cDNA was then ligated into the plasmid vector pBluescript® II KS+ 
(Stratagene, La Jolla, California), and the ligated material was transformed into E. coli host 
DH10B or DH12S by electroporation methods familiar to one skilled in the art (Soares, 
1994). After growth overnight at 37°C, DNA was recovered from the E. coli colonies after 
scraping the plates by processing as directed for the Mega-prep kit (Qiagen, Chatswbrth, 
California). The quality of the cDNA libraries was estimated by counting a portion of the 
total numbers of primary transformants and determining the average insert size and the 
percentage of plasmids with no cDNA insert. Additional cDNA libraries (human total brain, 
heart, kidney, leukocyte, and fetal brain) were purchased from Life Technologies, Bethesda, 
MD. 

cDNA libraries, both oligo (dT) and random hexamer (N 6 ) primed, were used for 
isolating cDNA clones transcribed within the HBM region: human bone, human brain, human 
kidney and human skeletal muscle (all cDNA libraries were made by the inventors, except for 
skeletal muscle (dT) and kidney (dT) cDNA libraries). Four 10 x 10 arrays of each of the 
cDNA libraries were prepared as follows: the cDNA libraries were titered to 2.5 x 10 6 using 
primary transformants. The appropriate volume of frozen stock was used to inoculate 2 L of 
LB/ampicillin (100 mg/ml). This inoculated liquid culture was aliquoted into 400 tubes of 4 
ml each. Each tube contained approximately 5000 cfu. The tubes were incubated at 30°C 
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overnight with gentle agitation. The cultures were grown to an OD of 0.7-0.9. Frozen stocks 
were prepared for each of the cultures by allocating 100 jllI of culture and 300 \il of 80% 
glycerol. Stocks were frozen ha a dry ice/ethanol bath and stored at -70°C. The remaining 
culture was DNA prepared using the Qiagen (Chatsworth, CA) spin miniprep kit according to 
5 the manufacturer's instructions. The DNAs from the 400 cultures were pooled to make 80 
column and row pools. The cDNA libraries were determined to contain HBM cDNA clones 
of interest by PCR. Markers were designed to amplify putative exons. Once a standard PCR 
optimization was performed and specific cDNA libraries were determined to contain cDNA 
clones of interest, the markers were used to screen the arrayed library. Positive addresses 
10 indicating the presence of cDNA clones were confirmed by a second PCR using the same 
markers. 

Once a cDNA library was identified as likely to contain cDNA clones corresponding 
to a specific transcript of interest from the HBM region, it was manipulated to isolate the 
clone or clones containing cDNA inserts identical to the EST or direct-selected cDNA of 

1 5 interest. This was .accomplished by a modification of the standard "colony screening" 

method (Sambrook et al 9 Molecular Cloning: A Labor atoiy Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor NY (1989)). Specifically, twenty 150 mm LB+ampicillin 
agar plates were spread with 20,000 colony forming units (cfu) of cDNA library and the 
colonies allowed to grow overnight at 37°C. Colonies were transferred to nylon filters 

20 (Hybond from Amersham, or equivalent) and duplicates prepared by pressing two filters 

together essentially as described (Sambrook et ah, Molecular Cloning: A Laboratoiy Manual, 
Cold Spring Harbor Laboratory, Cold Spring Harbor NY (1989)). The "master" plate was 
then incubated an additional 6-8 hours to allow the colonies to grow back. The DNA from 
the bacterial colonies was then affixed to the nylon filters by treating the filters sequentially 

25 with denaturing solution (0.5 N NaOH, 1.5 M NaCl) for two minutes, neutralization solution 
(0.5 M Tris-Cl pH 8.0, 1.5 M NaCl) for two minutes (twice). The bacterial colonies were 
removed from the filters by washing in a solution of 2X SSC/0. 1 % SDS for one minute while 
rubbing with tissue paper. The filters were air dried and baked under vacuum at 80 °C for 1-2 
hours. 

30 A cDNA hybridization probe was prepared by random hexamer labeling (Fineberg 
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and Vogelstein, And. Biochem., 132:6-13 (1983)) or by including gene-specific primers and 
no random hexamers in the reaction (for small fragments). Specific activity was calculated 
and was >5xl0 8 cpm/10 8 ug of cDNA. The colony membranes were then prewashed in 10 
mM Tris-Cl pH 8.0, 1 M NaCl, 1 mM EDTA, 0.1% SDS for 30 minutes at 55 °C. Following 
the prewash, the filters were prehybridized in > 2 ml/filter of 6X SSC, 50 % deionized 
formamide, 2% SDS, 5X Denhardt's solution, and 100 mg/ml denatured salmon sperm DNA, 
at 42 °C for 30 minutes. The filters were then transferred to hybridization solution (6X SSC, 
2% SDS, 5X Denhardt's, 100 mg/ml denatured salmon sperm DNA) containing denatured 
a 32 P-dCTP-labeled cDNA probe and incubated at 42 °C for 16-18 hours. 

After the 16-18 hour incubation, the filters were washed under constant agitation in 
2X SSC, 2% SDS at room temperature for 20 minutes, followed by two washes at 65 °C for 
15 minutes each. A second wash was performed in 0.5 X SSC, 0.5% SDS for 15 minutes at 
65 ° C. Filters were then wrapped in plastic wrap and exposed to radiographic film for several 
hours to overnight. After film development, individual colonies on plates were aligned with 
the autoradiograph so that they could be picked into a 1 ml solution of LB Broth containing 
ampicillin. After shaking at 37 °C for 1-2 hours, aliquots of the solution were plated on 150 
mm plates for secondary screening. Secondary screening was identical to primary screening 
(above) except that it was performed on plates containing -250 colonies so that individual 
colonies could be clearly identified for picking. 

After colony screening with radiolabeled probes yielded cDNA clones, the clones 
were characterized by restriction endonuclease cleavage, PCR, and direct sequencing to 
confirm the sequence identity between the original probe and the isolated clone. To obtain 
the full-length cDNA, the novel sequence from the end of the clone identified was used to 
probe the library again. This process was repeated until the length of the cDNA cloned 
matches that estimated to be full-length by the northern blot analysis. 

RT-PCR was used as another method to isolate full length clones. The cDNA was 
synthesized and amplified using a "Superscript One Step RT-PCR" kit (Life Technologies, 
Gaithersburg, MD). The procedure involved adding 1.5 ug of RNA to the following: 25 ul of 
reaction mix provided which is a proprietary buffer mix with MgS0 4 and dNTP's> 1 ul sense 
primer (10 uM) and 1 ul anti-sense primer (10 uM), 1 ul reverse transcriptase and Taq DNA 
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polymerase mix provided and autoclaved water to a total reaction mix of 50 pi. The reaction 
was then placed in a thermocycler for 1 cycle at 50 °C for 15 to 30 minutes, then 94 °C for 15 
seconds, 55-60°C for 30 seconds and 68-72°C for 1 minute per kilobase of anticipated 
product and finally 1 cycle of 72°C for 5-10 minutes. The sample was analyzed on an 
5 agarose gel The product was excised from the gel and purified from the gel (GeneClean, Bio 
101). The purified product was cloned in pCTNR (General Contractor DNA Cloning 
System, 5 Prime - 3 Prime, Inc.) and sequenced to verify that the clone was specific to the 
gene of interest. 

Rapid Amplification of cDNA ends (RACE) was performed following the 

1 0 manufacturer's instructions using a Marathon cDNA Amplification Kit (Clontech, Palo Alto, 
CA) as a method for cloning the 5' and 3 ! ends of candidate genes. cDNA pools were 
prepared from total RNA by performing first strand synthesis, where a sample of total RNA 
sample was mixed with a modified oligo (dT) primer, heated to 70°C, cooled on ice and 
followed by the addition of: 5X first strand buffer, 10 mM dNTP mix, and AMV Reverse 

15 Transcriptase (20 U/pl). The tube was incubated at 42°C for one hour and then the reaction 
tube was placed on ice. For second strand synthesis, the following components were added 
directly to the reaction tube: 5x second strand buffer, 10 mM dNTP mix, sterile water, 20X 
second strand enzyme cocktail and the reaction tube was incubated at 16°C for 1.5 hours. T4 
DNA Polymerase was added to the reaction tube and incubated at 16 °C for 45 minutes. The 

20 second-strand synthesis was terminated with the addition of an EDTA/Glycogen mix. The 
sample was subjected to a phenol/chloroform extraction and an ammonium acetate 
precipitation. The cDNA pools were checked for quality by analyzing on an agarose gel for 
size distribution. Marathon cDNA adapters (Clontech) were then ligated onto the cDNA 
ends. The specific adapters contained priming sites that allowed for amplification of either 5 f 

25 or 3' ends, depending on the orientation of the gene specific primer (GSP) that was chosen. 
An aliquot of the double stranded cDNA was added to the following reagents: 10 jiM 
Marathon cDNA adapter, 5x DNA ligation buffer, T4 DNA ligase. The reaction was 
incubated at 16°C overnight. The reaction was heat inactivated to terminate the reaction. 
PCR was performed by the addition of the following to the diluted double stranded cDNA 

30 pool: 10X cDNA PCR reaction buffer, 10 pM dNTP mix, 10 pM GSP, 10 pM API primer 
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(kit), 50x Advantage cDNA Polymerase Mix. Thermal Cycling conditions were 94°C for 30 
seconds, 5 cycles of 94°C for 5 seconds, 72°C for 4minutes, 5 cycles of 94°C for 5 seconds, 
70°C for 4 minutes, 23 cycles of 94 °C for 5 seconds, 68 °C for 4 minutes. After the first 
round of PCR was performed using the GSP to extend to the end of the adapter to create the 
adapter primer binding site, exponential amplification of the specific cDNA of interest was 
observed. Usually a second nested PCR is performed to confirm the specific cDNA. The 
RACE product was analyzed on an agarose gel and then excised and purified from the gel 
(GeneClean, BIO 101). The RACE product was then cloned into pCTNR (General 
Contractor DNA Cloning System, 5' - 3\ Inc.) and the DNA sequence determined to verify 
that the clone is specific to the gene of interest. 

13. Mutation Analysis 

Comparative genes were identified using the above procedures and the exons from 
each gene were subjected to mutation detection analysis. Comparative DNA sequencing was 
used to identify polymorphisms in HBM candidate genes from chromosome 1 lql2-13. DNA 
sequences for candidate genes were amplified from patient lymphoblastoid cell lines. 

The inventors developed a method based on analysis of direct DNA sequencing of 
PCR products amplified from candidate regions to search for the causative polymorphism. 
The procedure consisted of three stages that used different subsets of HBM family to find 
segregating polymorphisms and a population panel to assess the frequency of the 
polymorphisms. The family resources result from a single founder leading to the assumption 
that all affected individuals will share the same causative polymorphism. 

Candidate regions were first screened in a subset of the HBM family consisting of the 
proband, daughter, and her mother, father and brother. Monochromosomal reference 
sequences were produced concurrently and used for comparison. The mother and daughter 
carried the HBM polymorphism in this nuclear family, providing the ability to monitor 
polymorphism transmission. The net result is that two HBM chromosomes and six non- 
HBM chromosomes were screened. This allowed exclusion of numerous frequent alleles. 
Only alleles exclusively present in the affected individuals passed to the next level of 
analysis. 
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Polymorphisms that segregated exclusively with the HBM phenotype in this original 
family were then re-examined in an extended portion of the HBM pedigree consisting of two 
additional nuclear families. These families consisted of five HBM and three unaffected 
individuals. The HBM individuals in this group included the two critical crossover 
5 individuals, providing the centromeric and telomeric boundaries of the critical region. 

Tracking the heredity of polymorphisms between these individuals and their affected parents 
allowed for further refining of the critical region. This group brought the total of HBM 
chromosomes screened to seven and the total of non-HBM chromosomes to seventeen. 
When a given polymorphism continued to segregate exclusively with the HBM 

10 phenotype in the extended group, a population panel was then examined. This panel of 84 
persons consisted of 42 individuals known to have normal bone mineral density and 42 
individuals known to be unrelated but with untyped bone mineral density. For this purpose, 
normal bone mineral density is within two standard deviations of a BMD Z score of 0. The 
second group was from the widely used CEPH panel of individuals. Any segregating 

15 polymorphisms found to be rare in this population were subsequently examined on the entire 
HBM pedigree and a larger population. 

Polymerase chain reaction (PCR) was used to generate sequencing templates from the 
HBM family's DNA and monochromosomal controls. Enzymatic amplification of genes 
within the HBM region on 1 lql2-13 was accomplished using the PCR with oligonucleotides 

20 flanking each exon as well as the putative 5' regulatory elements of each gene. The primers 
were chosen to amplify each exon as well as 15 or more base pairs within each intron on 
either side of the splice. All PCR primers were made as chimeras to facilitate dye primer 
sequencing. The M13-21F (5'- GTA A CGA CGG CCA GT -3') (SEQ ID NO:30) and - 
28REV (5'- AAC AGC TAT GAC CAT G -3') (SEQ ID NO:31) primer binding sites were 

25 built on to the 5 f end of each forward and reverse PCR primer, respectively, during synthesis. 
1 50 ng of genomic DNA was used in a 50 jul PCR with 2 U AmpliTaq, 500 nM primer and 
125 \iM dNTP. Buffer and cycling conditions were specific to each primer set. TaqStart 
antibody (Clontech) was used for hot start PCR to minimize primer dimer formation. 10% of 
the product was examined on an agarose gel. The appropriate samples were diluted 1:25 with 

30 deionized water before sequencing. 
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Each PCR product was sequenced according to the standard Energy Transfer primer 
(Amersham) protocol All reactions took place in 96 well trays. 4 separate reactions, one 
each for A, C, G and T were performed for each template. Each reaction included 2 jil of the 
sequencing reaction mix and 3 \x\ of diluted template. The plates were then heat sealed with 
foil tape and placed in a thermal cycler and cycled according to the manufacturer's 
recommendation. After cycling, the 4 reactions were pooled. 3 pi of the pooled product was 
transferred to a new 96 well plate and 1 pi of the manufacturer's loading dye was added to 
each well. All 96 well pipetting procedures occurred on a Hydra 96 pipetting station 
(Robbins Scientific, USA). 1 pi of pooled material was directly loaded onto a 48 lane gel 
running on an ABI 377 DNA sequencer for a 10 hour, 2.4 kV run. 

Polyphred (University of Washington) was used to assemble sequence sets for 
viewing with Consed (University of Washington). Sequences were assembled in groups 
representing all relevant family members and controls for a specified target region. This was 
done separately for each of the three stages. Forward and reverse reads were included for 
each individual along with reads from the monochromosomal templates and a color annotated 
reference sequence. Polyphred indicated potential polymorphic sites with a purple flag. Two 
readers independently viewed each assembly and assessed the validity of the purple-flagged 
sites. 

A total of 23 exons present in the mature mRNA and several other portions of the 
primary transcript were evaluated for heterozygosity in the nuclear family of two HBM- 
affected and two unaffected individuals. 25 SNPs were identified, as shown in the table 
below. 

TABLE 8 

Single Nucleotide Polymorphisms in the Zmaxl {LRP5) Gene and Environs 



Exon Na me Location Base Change 

b200e21-h_Contigl J.nt 69169 (309G) C/A 

b200e21-h_Contig4J2.nt 27402 (309G) A/G 

b200e21-h_Contig4_13.nt 27841 (309G) T/C 

b200e21-h_Contig4_16.nt 35600 (309G) A/G 

b200e21-h_Contig4_21.nt 45619 (309G) G/A 
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In addition to the polymorphisms presented in Table 8, two additional polymorphisms can 
also be present in SEQ ID NO:2. These is a change at position 2002 of SEQ ID NO:2. Either 
a guanine or an adenine can appear at this position. This polymorphism is silent and is not 
45 associated with any change in the amino acid sequence. The second change is at position 
4059 of SEQ ID NO:2 corresponding in a cytosine (C) to thymine (T) change. This 
polymorphism results in a corresponding amino acid change from a valine (V) to an alanine 
(A). Other polymorphisms were found in the candidate gene exons and adjacent intron 
sequences as displayed in Tables 2 and 3. Any one or combination of the polymorphisms 
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listed in Tables 2, 3 or 8 or the two discussed above could also have a minor effect on bone 
mass when present in SEQ ID NO:2. These could also be in combination with any of the 
other mutations discussed in Section 3 and in the Examples below. 

The present invention encompasses the nucleic acid sequences having the nucleic acid 
5 sequence of SEQ ID NO: 1 with any one or more of the above-identified point mutations. 

Preferably, the present invention encompasses the nucleic acid of SEQ ID NO: 2 or is 
a mutation in SEQ ID NO: 1 that produces a phenotype like that observed with the protein 
encoded by SEQ ID NO:2 (HBM phenotype). Specifically, a base-pair substitution changing 
G to T at position 582 in the coding sequence of Zmaxl (the HBM gene) was identified as 

10 heterozygous in all HBM individuals, and not found in the unaffected individuals (i.e., 
b527dl2-h__Contig087CJ.nt). Fig. 6 shows the order of the contigs in B527D12. The 
direction of transcription for the HBM gene is from left to right. The sequence of contig308G 
of B527D12 is the reverse complement of the coding region to the ZZBMgene. Therefore, the 
relative polymorphism in contig 308G shown in Table 8 as a base change substitution of C to 

15 A is the complement to the G to T substitution in the HBM gene. This mutation causes a 
substitution of glycine 171 with valine (G171V). 

The HBM polymorphism was confirmed by examining the DNA sequence of different 
groups of individuals. In all members of the HBM pedigree (38 individuals), the HBM 
polymorphism was observed in the heterozygous form in affected (i.e., elevated bone mass) 

20 individuals only (N=l 8). In unaffected relatives (N=20) (BMDZ<2.0), the HBM 

polymorphism was never observed. To determine whether this polymorphism was ever 
observed in individuals outside of the HBM pedigree, 297 phenotyped individuals were 
characterized at the site of the HBM gene. None were heterozygous at the site of the HBM 
polymorphism. In an unphenotyped control group, none of 64 individuals were observed to 

25 be heterozygous at position 582. Taken together, these data prove that the polymorphism 

observed in the kindred displaying the high bone mass phenotype is strongly correlated with 
the G-T polymorphism at position 582 of Zmaxl (LRP5). Furthermore, these results 
coupled with the ASO results described below, establish that the HBM polymorphism 
genetically segregates with the HBM phenotype, and that both the HBM polymorphism and 
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phenotype are rare in the general population. 

13. Allele S pecific Oligonucleotide (ASO) Analysis 

The amplicon containing the HBM polymorphism was PCR amplified using primers 
specific for the exon of interest. The appropriate population of individuals was PCR 
amplified in 96 well microtiter plates as follows. PCR reactions (20 jxl) containing IX 
Promega PCR buffer (Cat # Ml 883 containing 1.5 mM MgCy, 100 mM dNTP, 200 nM 
PCR primers (1863F: 5 ? -CCAAGTTCTGAGAAGTCC-3 , and 1864R: 5'- 
AATACCTGAAACCATACCTG-3'; SEQ ID NOS: 629 and 630 respectively), 1 U 
AmpliTaq™, and 20 ng of genomic DNA were prepared and amplified under the following 
PCR conditions: 94°C, 1 minute, (94°C, 30 sec; 58°C, 30 sec; 72°C, 1 min X35 cycles), 
72 °C, 5 min, 4°C, hold. Loading dye was then added and 10 \il of the products was 
electrophoresed on 1.5% agarose gels containing 1 |*g/ml ethidium bromide at 100-150 V for 
5-10 minutes. Gels were treated 20 minutes in denaturing solution (1 .5 M NaCl, 0.5 N 
NaOH), and rinsed briefly with water. Gels were then neutralized in 1 M Tris-HCl, pH 7.5, 
1 .5 M NaCl, for 20 minutes and rinsed with water. Gels were soaked in 10 X SSC for 20 
minutes and blotted onto nylon transfer membrane (Hybond N+- Amersham) in 10X SSC 
overnight. Filters were the rinsed in 6X SSC for 10 minutes and UV crosslinked. 

The allele specific oligonucleotides (ASO) were designed with the polymorphism 
approximately in the middle. Oligonucleotides were phosphate free at the 5' end and were 
purchased from Gibco BRL. Sequences of the oligonucleotides are: 

2326 Zmaxl.ASO.g: 5'-AGACTGGGGTGAGACGC-3' (SEQ ID NO:631) 

2327 Zmaxl.ASO.t: 5 ! -CAGACTGGGITGAGACGCC-3' (SEQ ID NO:632) 
The polymorphic nucleotides are underlined. To label the oligos, 1.5 jil of 1 jjtg/p.1 

ASO oligo (2326.Zmaxl.ASO.g or 2327.Zmaxl.ASO.t), 1 1 nl ddH 2 0, 2 jil 10X kinase 
forward buffer, 5 ^1 y 32 P-ATP (6000 Ci/mMole), and 1 jil T4 polynucleotide kinase (10 U/jul) 
were mixed, and the reaction incubated at 37 °C for 30-60 minutes. Reactions were then 
placed at 95 °C for 2 minutes and 30 ml H 2 0 was added. The probes were purified using a 
G25 microspin column (Pharmacia). 
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Blots were prehybridized in 10 ml 5X SSPE, 5X Denhardt's, 2% SDS, and 100 
M-g/ml, denatured, sonicated salmon sperm DNA at 40°C for 2 hr. The entire reaction mix of 
kinased oligo was then added to 10 ml fresh hybridization buffer (5X SSPE, 5X Denhardt's, 
2% SDS) and hybridized at 40°C for at least 4 hours to overnight, 
5 All washes done in 5X SSPE, 0.1 % SDS. The first wash was at 45°C for 15 minutes; 

the solution was then changed and the filters washed 50 °C for 15 minutes. Filters were then 
exposed to Kodak biomax film with 2 intensifying screens at -70 °C for 15 minutes to 1 hr. 
If necessary the filters were washed at 55 °C for 15 minutes and exposed to film again. 
Filters were stripped by washing in boiling 0.1X SSC, 0.1% SDS for 10 minutes at least 3 
10 times. 

The two films that best captured the allele specific assay with the 2 ASOs were 
converted into digital images by scanning them into Adobe PhotoShop. These images were 
overlaid against each other in Graphic Converter and then scored and stored in FileMaker Pro 
4.0 (see Fig. 9). 

15 In order to determine the HBM allele frequency in ethnically diverse populations, 672 

random individuals from various ethnic groups were typed by the allele specific 
oligonucleotide (ASO) method. This population included 96 CEPH grandparents (primarily 
Caucasian), 192 Caucasian, 192 African- American, 96 Hispanic, and 96 Asian individuals. 
No evidence was obtained for the presence of the HBM polymorphism in any of these 

20 individuals. Overall, a total of 91 1 individuals were typed either by direct sequencing or 
ASO hybridization; all were homozygous GG at the site of the HBM polymorphism (Fig. 
14). This information illustrates that the HBM allele is rare in various ethnic populations. 

Thus this invention provides a rapid method of identifying individuals with the HBM 
allele. This method could be used in the area of diagnostics and screening of an individual 

25 for susceptibility to osteoporosis or other bone disorder. The assay could also be used to 
identify additional individuals with the HBM allele or the additional polymorphisms 
described herein. 
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14. Cellular Localization of Zmaxl (LRP5) 

14.1 Gene Expression in rat tibia by non isotopic in situ Hybridization 
In situ hybridization was conducted by Pathology Associates International (PAI), 
Frederick, MD. This study was undertaken to determine the specific cell types that express 
5 the Zmaxl (LRP5) gene in rat bone with particular emphasis on areas of bone growth and 
remodeling. Zmaxl (LRP5) probes used in this study were generated from both human 
(HuZmaxl) and mouse (MsZmaxl) cDNAs, which share an 87% sequence identity. The 
homology of human and mouse Zmaxl (LRP5) with rat Zmaxl (LRP5) is unknown. 

For example, gene expression by non-isotopic in situ hybridization was performed as 
10 follows, but other methods would be known to the skilled artisan. Tibias were collected from 
two 6 to 8 week old female Sprague Dawley rats euthanized by carbon dioxide asphyxiation. 
Distal ends were removed and proximal tibias were snap frozen in OCT embedding medium 
with liquid nitrogen immediately following death. Tissues were stored in a -80 °C freezer. 
Probes for amplifying PCR products from cDNA were prepared as follows. The 
15 primers to amplify PCR products from a cDNA clone were chosen using published sequences 
of both human LRP5 (GenBank Accession No. AB017498) and mouse LRP5 (GenBank 
Accession No. AF064984). In order to minimize cross reactivity with other genes in the LDL 
receptor family, the PCR products were derived from an intracellular portion of the protein 
coding region. PCR was performed in a 50 reaction volume using cDNA clone as 
20 template. PCR reactions contained 1 .5 mM MgCl 2 , 1 unit AmpliTaq, 200 \xM dNTPs and 2 
each primer. PCR cycling conditions were 94°C for 1 min., followed by 35 cycles of 
94°C for 30 seconds, 55 °C for 30 seconds, 72°C for 30 seconds; followed by a 5 minute 
extension at 72 °C. The reactions were then run on a 1.5% agarose Tris-Acetate gel. DNA 
was eluted from the agarose, ethanol precipitated and resuspended in 10 mM Tris, pH 8.0. 
25 Gel purified PCR products were prepared for both mouse and human cDNAs and supplied to 
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Pathology Associates International for in situ hybridizations. 

The sequence of the human and mouse PCR primers and products were as follows: 
Human Zmaxl (TRP5) sense primer 

5*-CCCGTGTGCTCCGCCGCCCAGTTC-3' (SEQ ID NO:633) 
Human Zmaxl (TJRP5) antkense pri mer (HBM465) 

5'-GGCTCACGGAGCTCATCATGGACTT-3' (SEQ ID NO:634) 
Human Zmaxl PCR prnrlnnr 

5'-CCCGTGTGCTCCGCCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCT 

GCGCCTGCGCTGCGACGGCGAGGCAGACTGTCAGGACCGCTCAGACGAGGTGG 

ACTGTGACGCCATCTGCCTGCCCAACCAGTTCCGGTGTGCGAGCGGCCAGTGTGT 

CCTCATCAAACAGCAGTGCGACTCCTTCCCCGACTGTATCGACGGCTCCGACGAG 

CTCATGTGTGAAATCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACAGCAGT 

GCCATCGGGCCCGTCATTGGCATCATCCTCTCTCTCTTCGTCATGGGTGGTGTCTA 

TTTTGTGTGCCAGCGCGTGGTGTGCCAGCGCTATGCGGGGGCCAACGGGCCCTTC 

CCGCACGAGTATGTCAGCGGGACCCCGCACGTGCCCCTCAATTTCATAGCCCCGG 

GCGGTTCCCAGCATGGCCCCTTCACAGGCATCGCATGCGGAAAGTCCATGATGA 

GCTCCGTGAGCC-3' (SEQ ID NO:635) 

Mouse Zmaxl aJRP5) Sense primer (HBM65i) 

5'-AGCGAGGCCACCATCCACAGG-3' (SEQ ID NO:636) 
Mouse Zmaxl (LEES) antisense p rimer (HBM656) 

5'-TCGCTGGTCGGCATAATCAAT-3' (SEQ ID NO:637) 
Mouse (T,RP5^ 1 PCR product 

5'-AGCAGAGCCACCATCCACAGGATCTCCCTGGAGACTAACAACAACGATGTGG 
CTATCCCACTCACGGGTGTCAAAGAGGCCTCTGCACTGGACTTTGATGTGTCCAA 
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CAATCACATCTACTGGACTGATGTTAGCCTCAAGACGATCAGCCGAGCCTTCATG 

AATGGGAGCTCAGTGGAGCACGTGATTGAGTTTGGCCTCGACTACCCTGAAGGA 

ATGGCTGTGGACTGGATGGGCAAGAACCTCTATTGGGCGGACACAGGGACCAAC 

AGGATTGAGGTGGCCCGGCTGGATGGGCAGTTCCGGCAGGTGCTTGTGTGGAGA 

GACCTTGACAACCCCAGGTCTCTGGCTCTGGATCCTACTAAAGGCTACATCTACT 

GGACTGAGTGGGGTGGCAAGCCAAGGATTGTGCGGGCCTTCATGGATGGGACCA 

ATTGTATGACACTGGTAGACAAGGTGGGCCGGGCCAACGACCTCACCATTGATT 

ATGCCGACCAGCGA-3 f (SEQ ID NO:638) 

Riboprobes were synthesized as follows. The PCR products were reamplified with 
chimeric primers designed to incorporate either a T3 promoter upstream, or a T7 promoter 
downstream of the reamplification products. The resulting PCR products were used as 
template to synthesize digoxigenin-labeled riboprobes by in vitro transcription (TVT). 
Antisense and sense riboprobes were synthesized using T7 and T3 RNA polymerases, 
respectively, in the presence of digoxigenin-1 1-UTP (Boehringer-Mannheim) using a 
MAXIscript IVT kit (Ambion) according to the manufacturer. The DNA was then degraded 
with Dnase-1, and unincorporated digoxigenin was removed by ultrafiltration. Riboprobe 
integrity was assessed by electrophoresis through a denaturing polyacrylamide gel. 
Molecular size was compared with the electrophoretic mobility of a 100-1000 base pair (bp) 
RNA ladder (Ambion). Probe yield and labeling was evaluated by blot immunochemistry. 
Riboprobes were stored in 5 |il aliquots at -80 °C. 

The in situ hybridization was performed as follows. Frozen rat bone was cut into 5 
\xM sections on a Jung CM3000 cryostat (Leica) and mounted on adhesive slides 
(Instrumedics). Sections were kept in the cryostat at -2G°C until all the slides were prepared 
in order to prevent mRNA degradation prior to post-fixation for 15 minutes in 4% 
paraformaldehyde. Following post-fixation, sections were incubated with 1 ng/fil of either 
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antisense or sense riboprobe in Pathology Associates International (PAI) customized 
hybridization buffer for approximately 40 hours at 58 °C. Following hybridization, slides 
were subjected to a series of post-hybridization stringency washes to reduce nonspecific 
probe binding. Hybridization was visualized by immunohistochemistry with an anti- 
digoxigenin antibody (FAB fragment) conjugated to alkaline phosphatase. Nitroblue 
tetrazolium chloride/bromochloroindolyl phosphate (Boehringer-Maiuiheim), a precipitating 
alkaline phosphatase substrate, was used as the chromogen to stain hybridizing cells purple to 
nearly black, depending on the degree of staining. Tissue sections were counter-stained with 
nuclear fast red. Assay controls included omission of the probe, omission of probe and anti- 
digoxigenin antibody. 

Specific cell types were assessed for demonstration of hybridization with antisense 
probes by visualizing a purple to black cytoplasmic and/or peri-nuclear staining indicating a 
positive hybridization signal for mRNA. Each cell type was compared to the replicate 
sections, which were hybridized with the respective sense probe. Results were considered 
positive if staining was observed with the antisense probe and no staining or weak 
background with the sense probe. 

The cellular localization of the hybridization signal for each of the study probes is 
summarized in Table 9. Hybridization for Zmaxl (LRP5) was primarily detected in areas of 
bone involved in remodeling, including the endosteum and trabecular bone within the 
metaphysis. Hybridization in selected bone lining cells of the periosteum and epiphysis were 
also observed. Positive signal was also noted in chondrocytes within the growth plate, 
particularly in the proliferating chondrocytes. See Figs. 10, 1 1 and 12 for representative 
photomicrographs of in situ hybridization results. 
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TABLE 9 



10 



15 



20 



25 



30 



35 



PROBE 


SITE 


ISH SIGNAL 


Hu Zmaxl 


Rpiphvsis 






Osteoblasts 






Osteoclasts 






Growth Plate 






resting chondrocytes 






proliferating chondrocytes 


+ 




hypertrophic chondrocytes 






Metanhvsis 






osteoblasts 


+ ■ 




osteocl asts 


+ 




Diaphysis 






Bndosteum 






osteoblasts 


+ 




osteoclasts 


+ 




Periosteum 










MsZmaxl 


Eomhvsis 






Osteoblasts 






Osteoclasts 






Growth Plate 






resting chondrocytes 






proliferating chondrocytes 


+ 




hypertrophic chondrocytes 


+ 




Metaphysis 






osteoblasts 


+ 




osteoclasts 


+ 




Diaphysis 






Endosteum 






osteoblasts 


+ 




osteoclasts 


+ 




Periosteum 


+ 









Legend: = hybridization signal detected 
"ISH" - In situ hybridization 



; = no hybridization signal detected 



40 



These studies confirm the positional expression of Zmaxl (LRP5) in cells involved in 
bone remodeling and bone formation. Zmaxl (LRP5) expression in the zone of proliferation 
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and in the osteoblasts and osteoclasts of the proximal metaphysis, suggests that the Zmaxl 
(LKP5) gene is involved in the process of bone growth and mineralization. The activity and 
differentiation of osteoblasts and osteoclasts are closely coordinated during development as 
bone is formed and during growth as well as in adult life as bone undergoes continuous 
remodeling. The formation of internal bone structures and bone remodeling result from the 
coupling of bone resorption by activated osteoclasts with subsequent deposition of new 
material by osteoblasts. Zmaxl (LRP5) is related to the LDL receptor gene, and thus may be 
a receptor involved in mechanosensation and subsequent signaling in the process of bone 
remodeling. Therefore, changes in the level of expression of this gene could impact on the 
rate of remodeling and degree of mineralization of bone. 

15, Antisense. 

Antisense oligonucleotides are short synthetic nucleic acids that contain 
complementary base sequences to a targeted RNA. Hybridization of the RNA in living cells 
with the antisense oligonucleotide interferes with RNA function and ultimately blocks protein 
expression. Therefore, any gene for which the partial sequence is known can be targeted by 
an antisense oligonucleotide. 

Antisense technology is becoming a widely used research tool and will play an 
increasingly important role in the validation and elucidation of therapeutic targets identified 
by genomic sequencing efforts. 

Antisense technology was developed to inhibit gene expression by utilizing an 
oligonucleotide complementary to the mRNA that encodes the target gene. There are several 
possible mechanisms for the inhibitory effects of antisense oligonucleotides. Among them, 
degradation of mRNA by RNase H is considered to be the major mechanism of inhibition of 
protein function. This technique was originally used to elucidate the function of a target 
gene, but may also have therapeutic applications, provided it is designed carefully and 
properly. 

An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 
45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be constructed 
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using chemical synthesis and enzymatic ligation reactions using procedures known in the art. 
For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically 
synthesized using naturally occurring nucleotides or variously modified nucleotides designed 
to increase the biological stability of the molecules or to increase the physical stability of the 
5 duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. Examples of modified 
nucleotides which can be used to generate the antisense nucleic acid include 5-fluorouracil, 5- 
bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 

10 carboxymethlaminomethyluracil, dihydrouracil, p-D-galactosylqueosine, inosine, N6- 
isopentenyladenine, I-methylguanine, I-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 

15 isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2- 
thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v), t-methyl-2-thiouracil, 3-(3-amino-3- 
N-2-carboxypropyI) uracil, (acp3)w, and 2,6-diaminopurine. 

In addition, the use of morpholino oligonucleotides could be employed. Moipholinos 

20 are ohgomers with modification of the ribose moiety to a morpholino group. This technology 
is covered by U.S. Patent No. 5,185,444 and is described in Summerton et al, 1997 Antisense 
Nucleic Acid Drug Dev. 7(3): 1 87-95. The antisense nucleic acid molecules of the invention 
are typically administered to a subject or generated in situ such that they hybridize with or 
bind to cellular mRNA and/or genomic DNA encoding an HBM or Zmaxl (LRP5) protein or 

25 a protein which interacts with Zmaxl (LRP5) and/or HBM to thereby inhibit expression of 
the protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the case 
of an antisense nucleic acid molecule which binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An example of a route of administration 
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of an antisense nucleic acid molecule of the invention includes direct injection at a tissue site. 
Alternatively, an antisense nucleic acid molecule can be modified to target selected cells and 
then administered systemically. For example, for systemic administration, an antisense 
molecule can be modified such that it specifically binds to a receptor or an antigen expressed 
5 on a selected cell surface, e.g., by linking the antisense nucleic acid molecule to a peptide or 
an antibody which binds to a cell surface receptor or antigen. The antisense nucleic acid 
molecule can also be delivered to cells using the vectors described herein. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
a-anomeric nucleic acid molecule. An p,-anomeric nucleic acid molecule forms specific 

10 double-stranded hybrids with complementary RNA in which, contrary to the usual y-units, 
the strands run parallel to each other (Gaultier et al, 1987 NucL Acids Res. 15: 6625-41). 
The antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide (Inoue et 
al, 1987 NucL Acids Res. 15: 6131-6148) or a chimeric RNA-DNA analogue (Inoue et al., 
1987 FEES Lett 215: 327-330). In still another embodiment, an antisense nucleic acid of the 

1 5 invention is a ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity 
which are capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which 
they have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes 
(described in Haselhoff et al., 1988 Nature 334: 585-591) can be used to catalytically cleave 
Zmaxl (LRP5) or HBM mRNA transcripts to thereby inhibit translation of Zmaxl (LRP5) or 

20 HBM mRNA. A ribozyme having specificity for a Zmaxl- or HBM-encoding nucleic acid 
can be designed based upon the nucleotide sequence of a Zmaxl (LRP5) or HBM cDNA 
disclosed herein (i.e., SEQ ID NOS:l or 3). For example, a derivative of a Tetrahymena L-19 
IVS RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved in an HBM or Zmaxl -encoding 

25 mRNA. See, e.g., Cech et al U.S. Patent No. 4,987,071 and Cech et al. U.S. Patent No. 
5,1 16,742 both incorporated herein by reference. Alternatively, Zmaxl (LRP5) or HBM 
mRNA can be used to select a catalytic RNA having a specific ribonuclease activity from a 
pool of RNA molecules. See, e.g., Bartel^a/., 1993 Science 261: 1411-1418. Alternatively, 
Zmaxl (LRP5) or HBM gene expression can be inhibited by targeting nucleotide sequences 
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complementary to the regulatory region of the Zmaxl {LRP5)ox HBM gene (e.g., the Zmaxl 
or HBM gene promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the Zmaxl (LRP5) or HBM genes in target cells. See generally, Helene, 1991 
Anticancer Drug Des. 6(6):569-84; Helene etaL, 1992 Ann. N.Y. Acad Set 660:27-36; and 
5 Maher, 1992 Bioassays 14(12):807-15. 

Zmaxl (LKP5\ LRP6, HBM-like and HBM gene expression can also be inhibited 
using RNA interference (RNAi) caused by small inhibitory RNAs (siRNAs). This is a 
technique for post-transcriptional gene silencing (PTGS), in which target gene activity is 
specifically abolished with cognate double-stranded RNA (dsRNA). siRNAs resemble in 

10 many aspects PTGS in plants and has been detected in many invertebrates including 

trypanosome, hydra, planaria, nematode and fruit fly {Drosophila melanogaster). It may be 
involved in the modulation of transposable element mobilization and antiviral state 
formation. RNAi in mammalian systems is disclosed in PCT application WO 00/63364 
which is incorporated by reference herein in its entirety. See also, Elbashir et a/., 2001 

1 5 Nature 411: 494-98. Basically, dsRNA, homologous to the target (Zmaxl or HBM) is 
introduced into the cell and a sequence specific reduction in gene activity is observed. 

Another embodiment contemplates the use of small hairpin RNAs (shRNAs). These 
compounds are described further in Yu et al 9 2002 Proc. Natl Acad. Set USA 99: 6047-52; 
and Paddison et aL, 2002 Genes & Devel 16: 948-58. 

20 As an example, preparing antisense oligonucleotides can be performed as follows. 

Studies have been undertaken using antisense technology in the osteoblast-like murine cell 
line, MC3T3. These cells can be triggered to develop along the bone differentiation 
sequence. An initial proliferation period is characterized by minimal expression of 
differentiation markers and initial synthesis of collagenous extracellular matrix. Collagen 

25 matrix synthesis is required for subsequent induction of differentiation markers. Once the 
matrix synthesis begins, osteoblast marker genes are activated in a clear temporal sequence: 
alkaline phosphatase is induced at early times while bone sialoprotein and osteocalcin appear 
later in the differentiation process. This temporal sequence of gene expression is useful in 
monitoring the maturation and mineralization process. Matrix mineralization, which does not 
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begin until several days after maturation has started, involves deposition of mineral on and 
within collagen fibrils deep within the matrix near the cell layer-culture plate interface. The 
collagen fibril-associated mineral formed by cultured osteoblasts resembles that found in 
woven bone in vivo and therefore is used frequently as a study reagent. 
5 MC3T3 cells were transfected with antisense oligonucleotides for the first week of the 

differentiation, according to the manufacturer's specifications (U.S. Patent No. 5,849,902). 
The oligonucleotides designed for Zmaxl (LRP5) are given below: 
10875: 5 f -AGUACAGCUUCUUGCCAACCCAGUC-3 , (SEQ ID NO:639) 
10876: 5 f -UCCUCCAGGUCGAUGGUCAGCCCAU-3' (SEQ ID NO:640) 
10 10877: 5 , -GUCUGAGUCCGAGUUCAAAUCCAGG-3 t (SEQ ID NO:641) 

Fig. 13 shows the results of antisense inhibition of (LRP5) 1 in MC3T3 cells. The 
three oligonucleotides shown above were transfected into MC3T3 and RNA was isolated 
according to standard procedures. Northern analysis clearly shows markedly lower steady 
state levels of the Zmaxl (LRP5) transcript while the control gene GAPDH remained 
1 5 unchanged. Thus, antisense technology using the primers described above allows for the 
study of the role of Zmaxl (LRP5) expression on bone biology. 

16. Yeast Two Hybrid 

In order to identify the signaling pathway that Zmaxl (LRP5) participates in to 

20 modulate bone density, the yeast two hybrid protein interaction technology was utilized. 
This technique facilitates the identification of proteins that interact with one another by 
coupling tester proteins to components of a yeast transcription system (Fields et al, 1989 
Nature 340: 245-246; U.S. Pat. No. 5,283,173 by Fields et al\ Johnston, 1987 Microbiol 
Rev. 51: 458-476; Keegan et al. 9 1986 Science 231: 699-704; Durfee et al, 1993, Genes Dev. 

25 7: 555-569; Chien et al 9 1991 Proa Natl Acad. Sci USA 88: 9578-9582; Fields et al. 9 1994 
Trends in Genetics 10: 286-292; and Gyuris et al. 9 1993 Cell 75: 791-803). First a "bait" 
protein, the protein for which one seeks interacting proteins, is fused to the DNA binding 
domain of a yeast transcription factor. Second, a cDNA library is constructed that contains 
cDNAs fused to the transcriptional activation domain of the same yeast transcription factor; 
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this is termed the prey library. The bait construct and prey library are transformed into yeast 
cells and then mated to produce diploid cells. If the bait interacts with a specific prey from 
the cDNA library, the activation domain is brought into the vicinity of the promoter via this 
interaction. Transcription is then driven through selectable marker genes and growth on 
5 selective media indicates the presence of interacting proteins. 

The amino acid sequence used in the yeast two hybrid experiments discussed herein 
consisted of the entire cytoplasmic domain and a portion of the transmembrane domain and is 
shown below (amino to carboxy orientation): 

10 RWCQRYAGA NGPFPHEYVS GTPHVPLNFI APGGSQHGPF TGIACGKSMM 
SSVSIMGGRG GVPLYDRNHV TGASSSSSSS TKATLYPPI L MPPPSPA TDP 
SLYNMDMFYS SNIPATVRPY PPYTTRH MAP PTTP CSTDVC DSDYSASRWK 
ASKYYLDLNS DSDP YPPPPT EH SQYLSAE D SCPPSPA TER SYFHL FPPPP 
SPGTDSS (SEQ ID NO: 765) 

15 

The last 6 amino acids of the putative transmembrane domain are indicated in bold. 
Putative SH3 domains are underlined. Additional amino acid sequences of 50 amino acids or 
greater in either the proteins encoded by the Zmaxl (LRP5) or HBM alleles can also be used 
as bait. The upper size of the polypeptide used as bait is limited only by the presence of a 

20 complete transmembrane domain (see Fig. 4), which will render the bait to be nonfunctional 
in a yeast two hybrid system. These additional bait proteins can be used to identify additional 
proteins which interact with the proteins encoded by HBM or Zmaxl (LRP5) in the focal 
adhesion signaling pathway or in other pathways in which these HBM or Zmaxl (LRP5) 
proteins may act. Once identified, methods of identifying agents which regulate the proteins 

25 in the focal adhesion signaling pathway or other pathways in which HBM acts can be 
performed as described herein for the HBM and Zmaxl (LRP5) proteins. 

In order to identify cytoplasmic Zmaxl (LRP5) signaling pathways, the cytoplasmic 
domain of Zmaxl (LRP5) was subcloned into two bait vectors. The first vector was pDBleu, 
which was used to screen a brain, and Hela prey cDNA library cloned into the vector pPC86 

30 (Clontech). The second bait vector used was pDBtrp, which was used to screen a cDNA prey 
library derived from the TE85 osteosarcoma cell line in vector pOP46. Another suitable 
vector which is widely available, is p86 (Gibco, iest™ System). Standard techniques known 
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to those skilled in the art were used as described in Fields and Song, 1989 Nature 340: 245- 
246; U.S. Pat No. 5,283,173 by Fields and Song; Johnston, 1987 Microbiol. Rev. 5 1 : 458- 
476; Keegan et al, 1986 Science 231: 699-704; Durfee et al, 1993 Genes Dev. 7: 555-569; 
Chien et al, 1991 Proc. Natl. Acad. Sci USA 88: 9578-9582; Fields et al, 1994 Trends in 
Genetics 10: 286-292; and Gyuris et al, 1993 Cell 75: 791-803. The bait construct and prey 
cDNA libraries were transformed into yeast using standard procedures. 

To perform the protein interaction screen, an overnight culture of the bait yeast strain 
was grown in 20 ml SD selective medium with 2% glucose (pDBLeu, SD -Leu medium, 
pDBtrp, SD -trp medium). The cultures were shaken vigorously at 30°C overnight. The 
cultures were diluted 1:10 with complete medium (YEPD with 2% glucose) and the cultures 
then incubated with shaking for 2 hrs at 30 °C. 

The frozen prey library was thawed, and the yeast cells reactivated by growing them 
in 150 ml YEPD medium with 2% glucose for 2 hrs at 30°C. A filter unit was sterilized with 
70% ethanol and washed with sterile water to remove the ethanol. The cell densities of both 
bait and prey cultures were measured by determining the OD at 600 nm. An appropriate 
volume of yeast cells that corresponded to a cell number of 1 ml of OD 600 = 4 of each yeast 
strain, bait and prey (library) was placed in a 50 ml Falcon tube. The mixture was then 
filtered through the sterilized filter unit. The filter was then transferred onto a prewarmed 
YEPD agar plate with the cell side up, removing all air bubbles underneath the filter. Plates 
were then incubated at 30 °C for 6 hrs. One filter was transferred into a 50 ml Falcon tube, 
and 10 ml of SD with 2% Glucose was added; cells were resuspended by vortexing for 10 
sec. 

The number of primary diploid cells (growth on SD -Leu, -Trp plates) versus the 
numbers of colony forming units growing on SD -Trp and SD -Leu plates only was then 
titered. Different dilutions were plated and incubated at 30°C for two days. The number of 
colony forming units was then counted. The number of diploid colonies (colonies on SD - 
Leu -Trp plates) permits the calculation of whether or not the whole library of prey constructs 
was mated to the yeast expressing the bait. This information is important to judge the quality 
of the screen. 



104 



WO 02/092000 



PCT/US02/14877 



16.1 Indirect selection 

Resuspended cells from 5 filter-matings were then pooled and the cells sedimented by 
centrifugation in a 50 ml Falcon tube. Cells were then resuspended in 16 ml SD medium 
with 2% Glc. Two ml of this cell suspension was plated onto 8 square plates each (SD -Leu, - 
Trp) with sterile glass beads and selected for diploid cells by incubating at 30°C for 18 - 20 
hrs. 

Cells were then scraped off the square plates, the cells centrifuged and combined into 
one 50 ml Falcon tube. The cell pellet was then resuspended in 25 ml of SD medium with 
2% glucose. The cell number was then determined by counting of an appropriate dilution 
(usually 1:100 to 1:1000) with a Neugebauer chamber. Approximately 5 x 10 7 diploid cells 
were plated onto the selective medium. The observations about the growth of the bait strain 
together with irrelevant prey vectors helps to determine which selective plates will have to be 
chosen for the library screen. Generally, all screens were plated on one square plate each 
with SD -Leu, -Trp, -His; SD -Leu, -Trp, His, 5 mM 3AT, and SD -Leu, -Trp, -His, -Ade is 
recommended. 

The yeast cells were then spread homogeneously with sterile glass beads and 
incubated at 30°C for 4 days. The number of colony forming yeast cells was titered by 
plating different dilutions of the scraped cell suspension onto SD -Leu, -Trp plates. Usually, 
plating of 100 ^1 of a 10' 3 and 10" 4 dilution gave 100 - 1000 colonies per plate. 

16.2 Direct selection 

Five filters with the mated yeast cells were each transferred into separate 50 ml 
Falcon tubes and the cells resuspended with 10 ml SD medium with 2% Glc, each, followed 
by vortexing for 10 sec. The resuspended cells were combined and centrifuged in a Beckman 
centrifuge at 3000 rpm. The supernatant was discarded and the cells resuspended in 6 ml of 
SD medium with 2% Glc. Two ml of the suspension was spread onto each selective square 
plate and incubated at 30 °C for 4 - 5 days. 
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163 Isolation of Single Colonies 

Yeast cells from an isolated colony were picked with a sterile tooth pick and 
transferred into individual wells of a 96 well plate. The cells were resuspended in 50 nl of SD 
-Leu, -Trp, -His medium and incubated at 30°C for one day. The yeast cells were then 
5 stamped onto a SD -Leu, -Trp, -His plate in 96 well format and incubated at 30°C for 2 days. 
Yeast cells were also stamped onto a Nylon filter covering a YEPD plate and incubated at 
30°C for one day. The cells on the Nylon filter were used for the analysis of the 13 - Gal 
reporter activity. 

Yeast colonies were scraped from the SD -Leu, -Trp, -His plate with a sterile tooth 
10 pick, and reconfigured, if necessary, according to the B-Gal activity and then resuspended in 
20 % glycerol. This served as a master plate for storage at -80 °C. 

For DNA preparation, yeast cells from the glycerol stock were stamped onto a SD - 
Trp plate and incubated at 30°C for 2 days. After two days of incubation, the yeast colonies 
were ready for colony PCR and sequencing. Standard colony PCR conditions were used to 
1 5 amplify inserts from preys recovered from the interaction screen. Sequencing was done 
using standard sequencing reactions and ABI377 (Perkin Elmer) fluorescent sequencing 
machines. 

16.4 Verification of bait/prey interaction 

20 Glycerol stocks of the prey of interest were thawed and inoculated in a 1 0 ml 

overnight culture of SD with glucose -Trp. After overnight growth, plasmid DNA 
preparation was performed using the BIO 101 RPM Yeast Plasmid Isolation Kit with 10 ml 
of culture. The culture was centrifuged and transferred to a 1.5 ml microcentrifuge tube. 
Yeast Lysis Matrix was then added to the pellet followed by 250 ^1 of Alkaline Lysis 

25 Solution. Samples were then vortexed for 5 minutes. 250 yd Neutralizing Solution was 
added and the sample mixed briefly. Samples were centrifuged for 2 minutes at room 
temperature in a microcentrifuge. The supernatant was transferred to a Spin Filter avoiding 
debris and Lysis Matrix. 250 |il of Glassmilk Spin Buffer was added, and the tubes inverted 
to mix. Samples were centrifuged for 1 min and the liquid in the Catch Tube was discarded. 
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500 \il of Wash Solution was added, the samples were centrifuged for 1 min, and the wash 
solution was discarded. The wash step was repeated once followed by a 1 min dry 
centrifugation to drive the remaining liquid out of the Spin Filter. The filter was transferred 
to a new Catch Tube and 100 \il of sterile H 2 0 was added; samples were then vortexed briefly 
to resuspend and centrifuged for 30 seconds to collect the DNA in the bottom of the Catch 
Tube. 

Five ^1 of DNA was then transformed into DH10B Electromax cells using standard 
procedures and glycerol stocks prepared. Miniprep DNA was prepared using the Qiagen 
QIAprep Spin Miniprep Kit. DNA was finally eluted with 30 yd of Qiagen EB buffer. One 
(il of the plasmid DNA samples was then used to transform yeast cells using standard 
procedures. After 2 days of growth on SD -trp media, colonies were picked and patched onto 
fresh media. Similarly, bait colonies were patched onto SD -Leu media. Both were grown 
overnight at 30°C. 

For mating, cells from bait and prey patches were spread together on YAPD media 
and incubated at 30°C for 12 hr. This plate was then replica plated onto an SD Agar-Leu-Trp 
plate and grown for 2 days at 30°C. To test the strength of interaction these plates were 
replica plated onto SD Agar-Leu-Trp-His, SD Agar-Leu-Trp-His with 5 mM 3 AT and 10 
mM 3 AT, SD Agar-Leu-Trp-His- Ade, and SD Agar-Leu-Tip-Ura media and grown for 2 
daysat30°C. 

16.5 Galacton Star fi-Galactosidase Activity Assay 

After streaking and replica plating positive interactors on selection plates, colonies 
were placed in a 96 well dish with 200 \il of SD-medium, leaving wells 1 and 96 blank. Ten 
microliters from the first 96 well dish was plated into another flat bottom 96 well dish 
containing 100 jal of SD-medium. Controls consisted of a negative control and a very weak 
positive control. The cell density was measured at OD 600 (a value of 1 corresponds to lxlO 7 
cells utilizing a 96 well spectrophotometer). The OD was usually between 0.03 and 0.10. 
Using microplates specifically for the luminometer, 50 ^1 of reaction mixture were pipetted 
into each well. Fifty microliters of culture were then added and mixed by pipetting up and 
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down twice. The reaction was incubated for 30 minutes at room temperature followed by 
measurement of Relative Light Units using a luminometer. 

Table 9 lists the genes identified in the yeast two hybrid screens from the 3 prey 
libraries tested. Two genes, zyxin and axin, were found to interact with the cytoplasmic 
5 domain of (LRP5) 1 in all three screens. Three genes, alpha-actinin, TCB and SI -5 interacted 
in two of the three screens. 

A variety of proteins found at sites of cell-cell and cell-matrix contact (focal 
contacts/adesion plaques) were shown to interact with the cytoplasmic domain of Zmaxl 
(LRP5). These include a-actinin, Trio, Pinch-like protein, and Zyxin. PINCH is a LM 

10 domain-containing protein that is known to interact with integrin-linked kinase, an early 
signaler in integrin and growth factor signaling pathways. The finding of a closely related 
gene in the yeast two hybrid screen raises the possibility of a novel pathway linked to integrin 
signaling from extracellular matrix signals. Trio, also known to localize to focal adhesions, is 
thought to play a key role in coordinating cell-matrix interactions and cytoskeletal 

15 rearrangements involved in cell movement. Zyxin, another LIM domain-containing protein, 
is also localized to adhesion plaques and is thought to be involved in reorganization of the 
cytoskeleton when triggers are transmitted via integrin signaling pathways. Zyxin also 
interacts with alpha actinin, which we identified as interacting with Zmaxl (LRP5). Other 
LIM domain containing proteins identified include the human homologue of mouse ajuba, 

20 LIMD1, and a novel LMDl-like protein. 

Axin was also identified from the two hybrid experiments. This protein is involved in 
inhibition of the Wnt signaling pathway and interacts with the tumor suppressor APC. There 
is a link here with the focal adhesion signaling described above: one common step in the two 
pathways involves inhibition of glycogen synthase kinase 3, which in turn results in the 

25 activation of 6-catenin/Lef-l and AP-1 transcription factors. Axin/APC are involved in this 
as well as integrin linked kinase. The Wnt pathway has a role in determining cell fates during 
embryogenesis. If inappropriately activated, the Wnt pathway may also lead to cancer. The 
Wnt pathway also seems to have a role in cytoskeletal rearrangements. In aXenopus embryo 
assay, the combination of HBM and Wnt5a proteins stimulated the Wnt pathway to a much 
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greater extent than the combination of Zmaxl (LRP5) and Wnt5a, which was modestly above 
the control and Wnt5a alone scores. The HBM and Zmaxl (LRP5) extracellular domains 
(ECD) caused a modest stimulation of Wht signaling in the absence of Wnt5a which was 
slightly increased by the presence of Wnt5a in the presence of HBM ECD. A model 
depicting Zmaxl (LRP5) involvement in focal adhesion signaling is depicted in Fig. 15. 

This data coupled with other studies suggest that integrin signaling pathways have a 
role in cellular responses to mechanical stress and adhesion. This provides an attractive 
model for the mechanism of action of Zmaxl (LRP5) in bone biology. It is possible that 
Zmaxl (LRP5) is involved in sensing either mechanical stress directly or binding a molecule 
in the extracellular matrix that is related to mechanical sensation. Signaling through 
subsequent pathways may be involved in bone remodeling due to effects on cell morphology, 
cell adhesion, migration, proliferation, differentiation, and apoptosis in bone cells. 



TABLE 10: Yeast Two Hybrid Results 



Gene 
Symbol 


Gene 


Genbank 
Accession # 


NT 
SEQID 
NO: 


AA 
SEQID 
NO: 


ACTN1 


alpha-actinin 


NM_001102 


642 




AES 


amino-terminal enhancer of 


NM 001130.3 


643 




AIP4 


atrophin-1 interacting protein 


AF03 8564.1 


644 




Novel 


Ajuba 




645 




AXIN 


Wnt signaling 


AF009674.1 


646 




CDC23 


cell division cycle 23, yeast, 
homolog 


NM 004661.1 


647 




HSM800944 


Similar to TRIO 


ALII 7435.1 


748 




HSM800936 




AL1 17427.1 


649 




Novel 


Similar to LIM domains 
containing protein 1 




650 ; 




DEEPEST i 


mitotic spindle coiled-coil 
related protein 


NM_006461.1 


651 




ECM1 


extracellular matrix protein 1 


U65932.1 


652 




EF1A 


elongation factor 1 -alpha 


X16869.1 


653 




FN 


fibronectin 


X02761.1 


654 
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Gene 
Symbol 


Gene 


Genbank 
Accession # 


NT 

oJiiy ID 
NO- 


AA 

NCI- 


HOXB13 


homeodomain protein 


US 1599.1 


655 




Novel 


Glu-Lys Rich protein 




656 




LIMD1 


L1M domains containing 1 ; 


NM 014240.1 


657 




Novel 


PINCH-like 




658 




RANBPM 


centrosomal protein 


NM 005493.1 


659 




Sl-5 


extracellular protein 


U03 877.1 


660 




TCB 


gene encoding cytosolic 
thyroid hormone-bindmg 


M26252.1 


661 




TID 


tumorous imaginal discs 


NM 005147.1 


662 




ZYX 


Zyxin 


NM 003461.1 


663 




TRIO 


GTPase 


U42390.1 


664 




HUMPITPB 


phosphatidylinositol transfer 
protein 


D30037.1 


665 




ACTN1 


alpha-actinin 


NP 001093.1 




666 


AES 


amino4erminal enhancer of 


NP 001121.2 




667 


AIP4 


atrophin-1 interacting protein 


AAC04845.1 




668 


Novel 


Ajuba 






669 


AXIN 


Wnt signaling 


AAC51624.1 




670 


CDC23 


cell division cycle 23, yeast 
homolog 


NP_004652.1 




671- 


Novel 


Similar to TRIO CAB55923.1 






672 


Novel 


Similar to LIM domains 
containing protein l 






673 


DEEPEST 


mitotic spindle coiled-coil 
related protein 


NP_006452.1 




674 


ECM1 


extracellular matrix protein 1 


AAB05933.1 




675 


EF1A 


elongation factor 1 -alpha 


CAA34756.1 




676 


FN 


fibronectin 


CAA26536.1 




677 


Novel 


Glu-Lys rich protein 






678 


HOXB13 


homeodomain protein B 13 


AAB39863.1 




679 


LIMD1 


LIM domains containing 1 


NP 055055.1 




680 


Novel 


PINCH-like 






681 


RANBPM 


centrosomal protein 


NP 005484.1 




682 


Sl-5 


extracellular protein 


AAA65590.1 




683 
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Gene 


Gene 


Genbank 
Accession # 


NT 


AA 
NO- 


TCB 


cytosolic thyroid hormone- 
binding protein 


A A A O /"/"T^ 1 

AAA36672. 1 




684 


TDD 


tumorous imaginal discs 


NP 005138.1 




685 


ZYX 


Zyxin 


NP 003452.1 




686 


TRIO 


GTPase 


AAC34245.1 




687 


PTDINSTP 


phosphatidylinositol transfer 
protein beta isoform 


P48739 




688 



In light of the model depicted in Fig. 15 and the results shown in Table 10, another 
aspect contemplated by the invention would be to regulate bone density and bone mass 
disorders by the regulating focal adhesion signaling. The regulation can occur by regulating 

10 the DNA, mRNA transcript or protein encoded by any of the members involved in the focal 
adhesion signaling pathway as identified by the yeast two hybrid system. 

Also contemplated are the novel nucleic acids and proteins identified by the HBM 
yeast two hybrid system. These include but are not limited to SEQ ID NO: 645 (Ajuba), SEQ 
ID NO: 65 1 (a gene similar to a gene encoding LIM domains containing protein 1), SEQ ID 

15 NO: 656 (Glu-Lys Rich protein), SEQ ID NO: 658 (PINCH-like gene), SEQ ID NO: 669 

(Ajuba protein), SEQ ID NO: 672 (protein similar to TRIO), SEQ ID NO: 673, SEQ ID NO: 
678 (Glu-Lys rich protein) and SEQ ID NO: 681 (PINCH-like protein). 

17. Potential Function 

20 The protein encoded by Zmaxl (LRP5) and LRP6 are related to the Low Density 

Lipoprotein receptor (LDL receptor). See, Goldstein et al, 1985 Ann. Rev. Cell Biology, 1 : 
1-39; Brown et ah, 1986 Science, 232:34-41. The LDL receptor is responsible for uptake of 
low density lipoprotein, a lipid-protein aggregate that includes cholesterol. Individuals with a 
defect in the LDL receptor are deficient in cholesterol removal and tend to develop 

25 artherosclerosis. In addition, cells with a defective LDL receptor show increased production 
of cholesterol, in part because of altered feedback regulation of cholesterol synthetic enzymes 
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and in part because of increased transcription of the genes for these enzymes. In some cell 
types, cholesterol is a precursor for the formation of steroid hormones. 

Thus, the LDL receptor may, directly or indirectly, function as a signal transduction 
protein and may regulate gene expression. Because Zmaxl (LRP5) and LRP6 are related to 

5 the LDL receptor, this protein may also be involved in signaling between cells in a way that 
affects bone remodeling. 

The glycine 171 amino acid is likely to be important for the function of Zmaxl 
(LRP5) because this amino acid is also found in the mouse homologue of Zmaxl (LRP5). 
The closely related LRP6 (Genbank Accession No. JE0272) protein also contains glycine at 

10 the corresponding position (Brown et aL, 1988 Biochem. Biophys. Res. Comm. 248: 879-88). 
Amino acids that are important in a protein's structure or function tend to be conserved 
between species, because natural selection prevents mutations with altered amino acids at 
important positions from arising. 

In addition, the extracellular domain of Zmaxl (LRP5) contains four repeats 

15 consisting of five YWTD motifs followed by an EFG motif. This 5YWTD+EGF repeat is 
likely to form a distinct folded protein domain, as this repeat is also found in the LDL 
receptor and other LDL receptor-related proteins. The first three 5YWTD+EGF repeats are 
very similar in their structure, while the fourth is highly divergent. Glycine 171 occurs in the 
central YWTD motif of the first 5YWTD+EGF repeat in Zmaxl (LRP5). The other two 

20 similar 5YWTD+EGF repeats of Zmaxl (LRP5) also contain glycine at the corresponding 
position, as does the 5YWTD+EGF repeat in the LDL receptor protein. However, only 
17.6% of the amino acids are identical among the first three 5YWTD+EGF repeats in Zmaxl 
(LRP5) and the single repeat in the LDL receptor. These observations indicate that glycine 
171 is essential to the function of this repeat, and mutation of glycine 171 causes a functional 

25 alteration of Zmaxl (LRP5). The cDNA and peptide sequences are shown in Figs. 6A-6J. 
The critical base at nucleotide position 582 is indicated in bold and is underlined. 

Northern blot analysis (Figs. 7A-B) reveals that Zmaxl (LRP5) is expressed in human 
bone tissue as well as numerous other tissues. A multiple-tissue Northern blot (Clontech, 
Palo Alto, CA) was probed with exons from Zmaxl (LRP5). As shown in Fig. 7A, the 5.5 kb 
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Zmaxl (LRP5) transcript was highly expressed in heart, kidney, lung, liver and pancreas and 
is expressed at lower levels in skeletal muscle and brain. A second northern blot, shown in 
Fig. 7B, confirmed the transcript size at 5.5 kb, and indicated that Zmaxl (LRP5) is 
expressed in bone, bone marrow, calvaria and human osteoblastic cell lines. 

Taken together, these results coupled with the yeast two hybrid results indicate that 
the HBM polymorphism in the Zmaxl (LRP5) gene is responsible for the HBM phenotype, 
and that the Zmaxl gene is important in bone development. In addition, because mutation of 
Zmaxl can alter bone mineralization and development, it is likely that molecules that bind to 
Zmaxl may usefully alter bone development. Such molecules may include, for example, 
small molecules, proteins, RNA aptamers, peptide aptamers, and the like. 

18. Preparatio n of Nuclei c Acids , Vectors, Transformations and Host Cells 

Large amounts of the nucleic acids of the present invention may be produced by 
replication in a suitable host cell. Natural or synthetic nucleic acid fragments coding for a 
desired fragment will be incorporated into recombinant nucleic acid constructs, usually DNA 
constructs, capable of introduction into and replication in a prokaryotic or eukaryotic cell. 
Usually the nucleic acid constructs will be suitable for replication in a unicellular host, such 
as yeast or bacteria, but may also be intended for introduction to (with and without 
integration within the genome) cultured mammalian or plant or other eukaryotic cell lines. 
The purification of nucleic acids produced by the methods of the present invention is 
described, for example, in Sambrook et aL, Molecular Cloning. A Laboratory Manual, 2nd 
Ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989) or Ausubel et al. 9 
Current Protocols in Molecular Biology, J. Wiley and Sons, NY (1992). 

The nucleic acids of the present invention may also be produced by chemical 
synthesis, e.g., by the phosphoramidite method described by Beaucage et al, 1981 Tefra. 
Lett 22: 1859-62 or the triester method according to Matteucci,e/ a/., 1981 J. Am. Chem. Soc. 
103: 3185, and may be performed on commercial, automated oligonucleotide synthesizers. A 
double-stranded fragment may be obtained from the single-stranded product of chemical 
synthesis either by synthesizing the complementary strand and annealing the strands together 
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under appropriate conditions or by adding the complementary strand using DNA polymerase 
with an appropriate primer sequence. 

Nucleic acid constructs prepared for introduction into a prokaryotic or eukaryotic host 
may comprise a replication system recognized by the host, including the intended nucleic 
acid fragment encoding the desired protein, and will preferably also include transcription and 
translational initiation regulatory sequences operably linked to the protein encoding segment. 
Expression vectors may include, for example, an origin of replication or autonomously 
replicating sequence (ARS) and expression control sequences, a promoter, an enhancer and 
necessary processing information sites, such as ribosome-binding sites, RNA splice sites, 
polyadenylation sites, transcriptional terminator sequences, and mRNA stabilizing sequences. 
Secretion signals may also be included where appropriate, whether from a native HBM or 
Zmaxl (LRP5) protein or from other receptors or from secreted proteins of the same or 
related species, which allow the protein to cross and/or lodge in cell membranes, and thus 
attain its functional topology, or be secreted from the cell. Such vectors may be prepared by 
means of standard recombinant techniques well known in the art and discussed, for example, 
in Sambrook et al, (1989) or Ausubel et al 9 (1992). 

An appropriate promoter and other necessary vector sequences will be selected so as 
to be functional in the host, and may include, when appropriate, those naturally associated 
with Zmaxl (LRP5) or HBM genes. Examples of workable combinations of cell lines and 
expression vectors are described in Sambrook et al 9 (1989) or Ausubel et aL 9 (1992). Many 
useful vectors are known in the art and may be obtained from such vendors as Stratagene, 
New England BioLabs, Promega Biotech, and others. Promoters such as the top, lac and 
phage promoters, tRNA promoters and glycolytic enzyme promoters may be used in 
prokaryotic hosts. Useful yeast promoters include promoter regions for metallothionein, 3- 
phosphoglycerate kinase or other glycolytic enzymes such as enolase or glyceraldehyde-3- 
phosphate dehydrogenase, enzymes responsible for maltose and galactose utilization, and 
others. Vectors and promoters suitable for use in yeast expression are further described in EP 
73,675 A. Appropriate non-native mammalian promoters might include the early and late 
promoters from SV40 (Fiers et aL, 1978 Nature 273: 1 13) or promoters derived from murine 
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Moloney leukemia virus, mouse tumor virus, avian sarcoma viruses, adenovirus II, bovine 
papilloma virus or polyoma. In addition, the construct may be joined to an amplifiable gene 
(e.g., DHFR) so that multiple copies of the gene may be made. For appropriate enhancer and 
other expression control sequences, see also Enhancers and Eukaiyotic Gene Expression, 
5 Cold Spring Harbor Press, Cold Spring Harbor, NY (1983). 

While such expression vectors may replicate autonomously, they may also replicate 
by being inserted into the genome of the host cell, by methods well known in the art. 

Expression and cloning vectors will likely contain a selectable marker, a gene 
encoding a protein necessary for survival or growth of a host cell transformed with the vector. 

10 The presence of this gene ensures growth of only those host cells which express the inserts. 
Typical selection genes encode proteins that a) confer resistance to antibiotics or other toxic 
substances, e.g. ampicillin, neomycin, methotrexate, etc.; b) complement auxotrophic 
deficiencies, or c) supply critical nutrients not available from complex media, e.g., the gene 
encoding D-alanine racemase for Bacilli. The choice of the proper selectable marker will 

1 5 depend on the host cell, and appropriate markers for different hosts are well known in the art. 

The vectors containing the nucleic acids of interest can be transcribed in vitro, and the 
resulting RNA introduced into the host cell by well-known methods, e.g., by injection (see, 
Kubo et aL, FEES Letts. 241 : 1 19 (1988)), or the vectors can be introduced directly into host 
cells by methods well known in the art, which vary depending on the type of cellular host, 

20 including electroporation; transfection employing calcium chloride, rubidium chloride, 
calcium phosphate, DEAE-dextran, or other substances; microprojectile bombardment; 
lipofection; infection (where the vector is an infectious agent, such as a retroviral genome); 
and other methods. See generally, Sambrook et ah, 1989 and Ausubel et aL, 1992. The 
introduction of the nucleic acids into the host cell by any method known in the art, including 

25 those described above, will be referred to herein as "transformation." The cells into which 
have been introduced nucleic acids described above are meant to also include the progeny of 
such cells. 

Large quantities of the nucleic acids and proteins of the present invention may be 
prepared by expressing the Zmaxl(LRP5), LRP6, HBM or HBM-like nucleic acids or 
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portions thereof in vectors or other expression vehicles in compatible prokaryotic or 
eukaryotic host cells. The most commonly used prokaryotic hosts are strains of Escherichia 
coli, although other prokaryotes, such as Bacillus subtilis or Pseudomonas may also be used. 

Mammalian or other eukaryotic host cells, such as those of yeast, filamentous fungi, 
plant, insect, or amphibian or avian species, may also be useful for production of the proteins 
of the present invention. Propagation of mammalian cells in culture is per se well known. 
See, Jakoby and Pastan (eds.), Cell Culture. Methods in Enzymology, volume 58, Academic 
Press, Inc., Harcourt Brace Jovanovich, NY, (1979). Examples of commonly used 
mammalian host cell lines are VERO and HeLa cells, Chinese hamster ovary (CHO) cells, 
and WI38, BHK, and COS cell lines, although it will be appreciated by the skilled 
practitioner that other cell lines may be appropriate, e.g., to provide higher expression 
desirable glycosylation patterns, or other features. 

Clones are selected by using markers depending on the mode of the vector 
construction. The marker may be on the same or a different DNA molecule, preferably the 
same DNA molecule. In prokaryotic hosts, the transformant may be selected, e.g., by 
resistance to ampicillin, tetracycline or other antibiotics. Production of a particular product 
based on temperature sensitivity may also serve as an appropriate marker. 

Prokaryotic or eukaryotic cells transformed with the nucleic acids of the present 
invention will be useful not only for the production of the nucleic acids and proteins of the 
present invention, but also, for example, in studying the characteristics of Zmaxl (LRP5), 
LRP6, HBM or HBM-like proteins. 

Antisense nucleic acid sequences are useful in preventing or diminishing the 
expression of Zmaxl (LRP5), LRP6, HBM or HBM-like molecules, as will be appreciated by 
one skilled in the art. For example, nucleic acid vectors containing all or a portion of the 
Zmaxl(LRP5), LRP6, HBM or HBM-like nucleic acid may be placed under the control of a 
promoter in an antisense orientation and introduced into a cell. Expression of such an 
antisense construct within a cell will interfere with Zmaxl(LRP5), LRP6, HBM or HBM-like 
transcription and/or translation and/or replication. 

The probes and primers based on the Zmaxl (LRP5\ LRP6, HBM ox fZBM-like gene 
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sequences disclosed herein are used to identify homologous gene sequences and proteins in 
other species. The gene sequences and proteins can also be used in the diagnostic/prognostic, 
therapeutic and drug screening methods described herein for the species from which they 
have been isolated. 

5 

19. Protein Expression and Purification 

Expression and purification of the Zmaxl (LRP5), LRP6, HBM or HBM-like proteins 
of the invention can be performed essentially as outlined below (LRP5, LRP6 and HBM-like 
proteins are also included when referring only to HBM). To facilitate the cloning, expression 

10 and purification of membrane and secreted protein from the HBM gene, a gene expression 
system, such as the pET System (Novagen), for cloning and expression of recombinant 
proteins in E. coli was selected. Also, a DNA sequence encoding a peptide tag, the His-Tap, 
was fused to the 3 f end of DNA sequences of interest to facilitate purification of the 
recombinant protein products. The 3' end was selected for fusion to avoid alteration of any 5 f 

1 5 terminal signal sequence. 

Nucleic acids chosen, for example, from the nucleic acids set forth in SEQ ID NOS: 
1, 3 and 5-12 for cloning HBM were prepared by polymerase chain reaction (PCR). 
Synthetic oligonucleotide primers specific for the 5 f and 3 f ends of the HBM nucleotide 
sequence were designed and purchased from Life Technologies (Gaithersburg, MD). All 

20 forward primers (specific for the 5' end of the sequence) were designed to include an Ncol 
cloning site at the 5' terminus. These primers were designed to permit initiation of protein 
translation at the methionine residue encoded within the Ncol site followed by a valine 
residue and the protein encoded by the HBM DNA sequence. All reverse primers (specific 
for the 3 ! end of the sequence) included an EcoRl site at the 5' terminus to permit cloning of 

25 the HBM sequence into the reading frame of the pET-28b. The pET-28b vector provided a 
sequence encoding an additional 20 carboxyl-terminal amino acids including six histidine 
residues (at the C-terminus), which comprised the histidine affinity tag. 

Genomic DNA prepared from the HBM gene was used as the source of template DNA 
for PCR amplification (Ausubel et al> Current Protocols in Molecular Biology, John Wiley 
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& Sons (1994)). To amplify a DNA sequence containing the HBM nucleotide sequence, 
genomic DNA (50 ng) was introduced into a reaction vial containing 2 mM MgCl 2 , 1 jiM 
synthetic oligonucleotide primers (forward and reverse primers) complementary to and 
flanking a defined HBM, 0.2 mM of each of deoxynucleotide triphosphate, dATP, dGTP, 
5 dCTP, dTTP and 2.5 units of heat stable DNA polymerase (AmpliTaq™, Roche Molecular 
Systems, Inc., Branchburg, NJ) in a final volume of 100 microliters. 

Upon completion of thermal cycling reactions, each sample of amplified DNA was 
purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, MD). All 
amplified DNA samples were subjected to digestion with the restriction endonucleases, e.g., 

10 Ncol and EcdRL (New England BioLabs, Beverly, MA) (Ausubel et aL, Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1994)). DNA samples were then subjected to 
electrophoresis on 1.0% NuSeive (FMC BioProducts, Rockland, ME) agarose gels. DNA 
was visualized by exposure to ethidium bromide and long wave UV irradiation. DNA 
contained in slices isolated from the agarose gel was purified using the Bio 101 GeneClean 

15 Kit protocol (Bio 101, Vista, CA). 

The pET-28b vector was prepared for cloning by digestion with restriction 
endonucleases, e.g., Ncol and EcdRI (New England BioLabs, Beverly, MA) (Ausubel et aL, 
Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994)). The pET-28a 
vector, which encodes the histidine affinity tag that can be fused to the 5 1 end of an inserted 

20 gene, was prepared by digestion with appropriate restriction endonucleases. 

Following digestion, DNA inserts were cloned (Ausubel et aL, Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1994)) into the previously digested pET-28b 
expression vector. Products of the ligation reaction were then used to transform the BL21 
strain of E. coli (Ausubel et aL, 1994) as described below. 

25 Competent bacteria, E. coli strain BL21 or E. coli strain BL21 (DE3), were 

transformed with recombinant pET expression plasmids carrying the cloned HBM sequence 
according to standard methods (Ausubel et aL, 1994). Briefly, 1 pi of ligation reaction was 
mixed with 50 jil of electrocompetent cells and subjected to a high voltage pulse, after which 
samples were incubated in 0.45 ml SOC medium (0.5% yeast extract, 2.0% tryptone, 10 mM 
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NaCl, 2.5 mM KC1, 10 mM MgCl 2 , 10 mM MgS0 4 and 20 mM glucose) at 37 °C with 
shaking for 1 hour. Samples were then spread on LB agar plates containing 25 yg/ml 
kanamycin sulfate for growth overnight. Transformed colonies of BL21 were then picked 
and analyzed to evaluate cloned inserts, as described below. 

Individual BL21 clones transformed with recombinant pET-28b HBM nucleotide 
sequences were analyzed by PCR amplification of the cloned inserts using the same forward 
and reverse primers specific for the HBM sequences that were used in the original PCR 
amplification cloning reactions. Successful amplification verifies the integration of the HBM 
sequence in the expression vector (Ausubel et ai, 1994). 

Individual clones of recombinant pET-28b vectors carrying properly cloned HBM 
nucleotide sequences were picked and incubated in 5 ml of LB broth plus 25 n-g/ml 
kanamycin sulfate overnight. The following day plasmid DNA was isolated and purified 
using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA). 

The pET vector can be propagated in any E. coliK-12 strain, e.g., HMS174, HB101, 
JM109, DH5 and the like, for purposes of cloning or plasmid preparation. Hosts for 
expression include E. coli strains containing a chromosomal copy of the gene for T7 RNA 
polymerase. These hosts were lysogens of bacteriophage DE3, a lambda derivative that 
carries the lad gene, the lacUVS promoter and the gene for T7 RNA polymerase. T7 RNA 
polymerase was induced by addition of isopropyl-p-D-thiogalactoside (IPTG), and the T7 
RNA polymerase transcribes any target plasmid containing a functional T7 promoter, such as 
pET-28b, carrying its gene of interest. Strains include, for example, BL21(DE3) (Studier et 
aL, 1990 Meth. EnzymoL, 185: 60-89). 

To express the recombinant HBM sequence, 50 ng of plasmid DNA are isolated as 
described above to transform competent BL21(DE3) bacteria as described above (provided 
by Novagen as part of the pET expression kit). The lacZ gene ({3-galactosidase) is expressed 
in the pET-System as described for the HBM recombinant constructions. Transformed cells 
were cultured in SOC medium for 1 hour, and the culture was then plated on LB plates 
containing 25 |ag/ml kanamycin sulfate. The following day, the bacterial colonies were 
pooled and grown in LB medium containing kanamycin sulfate (25 ^ig/ml) to an optical 
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density at 600 nM of 0.5 to 1 .0 O.D. units, at which point 1 mM IPTG was added to the 
culture for 3 hours to induce gene expression of the HBM recombinant DNA constructions. 

After induction of gene expression with IPTG, bacteria were collected by 
centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 15 minutes at 4°C. Pellets were 
5 resuspended in 50 ml of cold mM Tris-HCl, pH 8.0, 0.1 M NaCl and 0.1 mM EDTA (STE 
buffer). Cells were then centrifuged at 2000 xg for 20 minutes at 4 °C. Wet pellets were 
weighed and frozen at -80 °C until ready for protein purification. 

19.1 Chinese Hamster Ovary (CHO) Expression System 
10 Alternatively, HBM and Zmaxl (LRP5) may be expressed in eukaryotic cells. 

Eukaryotic cells, such as mammalian derived cell lines, are more capable of expressing 
properly folded proteins containing cystine rich domains such as the EGF and LDLR 
modules. 

15 19.2 D eve l op ment of Constructs 

HBM and Zmaxl (LRP5) extracellular domain fusions (ECD) to IgG-Fc were 
prepared. These ECD fusions to the IgG-Fc domain remove the endogenous transmembrane 
and cytoplasmic portion of the Zmaxl/HBM receptor and should produce a secreted fusion 
protein. The Fc region is separated from the Zmaxl/HBM ECD by an enterokinase 

20 recognition site so that purified Zmaxl or HBM ECD protein can be obtained without the Fc 
domain present. The vector used for this construct was pHTop, a derivative of pED 
(Kaufinan et aL 9 1991 Nuc. Acids Res. 19: 4485-90) in which the majority of the adenomajor 
late promoter was replaced by six repeats of the tet operator (Gossen et ah, 1992 Proc. Natl. 
Acad. Sci USA 89: 5547-51). This vector contains the dihydrofolate reductase (dhfr) gene, 

25 and when introduced in the cell line CHO/A2 (see description below), functions very 

efficiently. Clones with high expression can be selected by isolating cells which survive in 
high methotrexate (MTX) concentrations. 

The CHO expression vector pHTOP-Fc was digested with Sail and Notl. The 
intervening sequence was purified away from the rest of the vector by electroelution from an 
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acrylamide gel slice. Sail cuts 5 1 to the intrinsic honey bee mellitin signal sequence in 
pHTOP-Fc, and Notl cuts just 5 f to the coding sequence IgGl-Fc. The resulting Satl-Notl 
pHTOP-Fc vector has the signal sequence removed and the Notl cloning site is amenable to 
creating a 5' fusion to IgG-Fc. Full-length Zmaxl (LRP5) in pCMVSPORT6 and fall-length 
HBM in pCMVSPORT6 were digested individually withZmal which cuts within the region 
of the ORF that encodes the signal sequence) and BamBl (that cuts internally in the ORF) to 
generate a 2286 bp 5 ! fragment of Zmaxl and HBM. The mutation which distinguishes 
Zmaxl from HBM lies on this fragment. Separately, the Zmaxl DNA was digested with 
BamUL and Sacl to isolate an 1800 bp 3 ? fragment which is common to both the Zmaxl and 
the HBM genes. Together, these two fragments constitute the coding sequence for the HBM 
and Zmaxl extracellular domains, less the coding sequence for the first 6 amino acids of the 
signal sequence and ending 18 amino acids prior to the end of the extracellular domain, 
which we estimated from Kyte-Doolittle plots to end at the sequence "SPAHSS" (SEQ ID 
NO:698). 

A synthetic duplex was designed to recreate the coding sequence of the Zmaxl/HBM 
signal sequence 5' of the native XmaX site, which included the initiator methionine and Kozak 
sequence. This duplex was designed to contain Sail (5 f ) andX/wal (3 T ) cohesive ends to adapt 
ends to adapt the gene fragments described above to the pHTOP-Fc vector. This synthetic 
duplex was constructed from two partially complementary oligonucleotides as given below: 
S'-TCGACCACCATGGAGGCAGCGCCGC^' (SEQ ID NO:699) 
3'-GGTGGTACCTCCGTCGCGGCGGGCC-5' (SEQ ID NO:700) 
A second synthetic duplex was designed to recreate the 3 ! coding sequence from a native Sacl 
site to the estimated end of the extracellular domain following the serine in the sequence ". . 
.SPAHSS", and to also encode a cloning site to allow in-frame fusion to the downstream IgG- 
Fc. This duplex was designed to contain Sacl (5 f ) and Notl (3 f ) cohesive ends to adapt the 
gene fragments described above to the pHTOP-Fc vector. This synthetic duplex was 
constructed from two partially complementary oligonucleotides whose sequences are given 
below: 

S^CATGTGTGAAATCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACAG 
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CAGTGGC-3' (SEQ ID NO:701) 

3-TCGAGTACACACTTTAGTGGTTCGGCGGGAGTCTGCTGTC GGGCCGGGT 
GTCGTCACCGCCGG-5 f (SEQ ID NO:702) 

The fragments, synthetic duplexes, and vector were ligated together in a single 
5 reaction. Separate reactions were performed for Zmax (LRP5) 1 and HBM. The ligation 
mixtures were used to transform electrocompetent E. coliHHlOB cells, and the resulting 
colonies were screened by radioactive colony hybridization using the common SacI-BamlR 3 1 
fragment as a probe. Colonies containing plasmids with the Zmaxl or HBM fragment 
inserted were identified, and plasmids were isolated from multiple candidates and their 
1 0 sequences were verified by DNA sequencing. Verified constructs were then used for 
transfection into CHO cells. 

19.2.1 Establishment of CHQ stable cell lines 

The CHO/A2 ceil line is derived from CHO DUKX B 1 1 (Urlaub et at ,1980 Proa 
15 Natl. Acad Set USA 77: 4216-20) by stably integrating a transcriptional activator (tTA), a 
fusion protein, between the Tet repressor and the herpes virus VP 16 transcriptional domain 
(Gossen et al) CHO cell lines expressing extracellular HBM-l.Fc and Zmaxl. Fc were 
established by transfecting (using lipofection) pHTopHBM-LFc into CHO/A2 cells and 
pHTopZmaxl.Fc into CHO/A2 cells. Clones were selected using by culturing the cells in 
20 0.02 (iM methotrexate. Clones were later amplified step-wise to a final concentration of 0.5 
jiM methotrexate. 

19.2.2 Screening of CHO stable cell lines 

Multiple clones were screened by a variety of techniques. Clones were screened by 
25 Western blot assay using a (mouse) anti-human IgG.Fc horseradish peroxidase (HRP) 

antibody. The same clones were also metabolically labeled with 35 S-Met/Cys) for a 6 hour 
pulse, or a 15 minute pulse, followed by a 1 hour, 4 hour, or 24 hour chase in media without 
radiolabeled Met/Cys. Immunoprecipitations were performed on proteins obtained from 
conditioned media, as well as from cell extracts. Purification is then performed followed by 
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sequencing of the proteins using N-terminal sequencing as known in the art. 

19.2.3 Fusion Protein Purification 

Zmaxl-IgG or HBM-IgG fusion protein can be purified from conditioned media or 
cell extracts of CHO stable cells. The fusion protein is isolated by affinity binding to protein 
A (for example using protein A coated beads or columns). The IgG-FC domain can then 
subsequently be cleaved from the Zmax/HBMl ECD protein by enterokinase digestion. 

19.2.4 Potential uses for cell lines and protein 

Stable cell lines may be used for generation of purified protein for use in ligand 
hunting, antibody generation, determination of crystal structure, and competitive binding 
assays. 

A variety of methodologies known in the art can be used to purify the isolated 
proteins (Coligan et al, Current Protocols in Protein Science, John Wiley & Sons (1995)). 
For example, the frozen cells can be thawed, resuspended in buffer and ruptured by several 
passages through a small volume microfluidizer (Model M-l 10S, Microfluidics International 
Corp., Newton, MA). The resultant homogenate is centrifuged to yield a clear supernatant 
(crude extract) and, following filtration, the crude extract is fractioned over columns. 
Fractions are monitored by absorbance at OD 280 nm and peak fractions may be analyzed by 
SDS-PAGE. 

The concentrations of purified protein preparations are quantified 
spectrophotometrically using absorbance coefficients calculated from amino acid content 
(Perkins, 1986 Eur. J. Biochem. 157:169-180). Protein concentrations are also measured by 
the method of Bradford, 1976 Anal Biochem. 72; 248-54 and Lowry et al, 1951 J. Biol 
Chem. 193: 265-275 using bovine serum albumin as a standard, 

SDS-polyacrylamide gels of various concentrations were purchased from BioRad 
(Hercules, CA), and stained with Coomassie blue. Molecular weight markers may include 
rabbit skeletal muscle myosin (200 kDa), E. coli (3-galactosidase (116 kDa), rabbit muscle 
phosphoiylase B (97.4 kDa), bovine serum albumin (66.2 kDa), ovalbumin (45 kDa), bovine 
carbonic anhydrase (31 kDa), soybean trypsin inhibitor (21.5 kDa), egg white lysozyme (14.4 
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kDa) and bovine aprotinin (6.5 kDa). 

Once a sufficient quantity of the desired protein has been obtained, it may be used for 
various purposes. A typical use is the production of antibodies specific for binding. These 
antibodies may be either polyclonal or monoclonal, and may be produced by in vitro or in 

5 vivo techniques well known in the art. Monoclonal antibodies to epitopes of any of the 
peptides identified and isolated as described can be prepared from murine hybridomas 
(Kohler, 1975 Nature, 256: 495). In summary, a mouse is inoculated with a few micrograms 
ofHBM protein over a period of two weeks. The mouse is then sacrificed. The cells that 
produce antibodies are then removed from the mouse's spleen. The spleen cells are then 

10 fused with polyethylene glycol with mouse myeloma cells. The successfully fused cells are 
diluted in a microtiter plate and growth of the culture is continued. The amount of antibody 
per well is measured by immunoassay methods such as ELISA (Engvall, 1980 Meth. 
EnzymoL 70: 419). Clones producing antibody can be expanded and further propagated to 
produce HBM antibodies. Other suitable techniques involve in vitro exposure of 

15 lymphocytes to the antigenic polypeptides, or alternatively, to selection of libraries of 
antibodies in phage or similar vectors. See Huse et al, 1989 Science 246: 1275-81. For 
additional information on antibody production see Davis et al, Basic Methods in Molecular 
Biology , Elsevier, NY, Section 21-2 (1989). 

Additional uses for purified or isolated protein includes use in X-ray crystallography, 

20 binding assays, and so forth as described in greater detail infra. 

19.3 Zmaxl , LRPfi, and Variant Antibodies 

Polyclonal antibodies were developed to both human Zmaxl (LRP5) (SEQ ID NO:3) 
and LRP6 (GenBank Accession No. AF074264). Antibodies can similarly be prepared 
25 against HBM and HBM like proteins and polypeptides. Peptides from the Zmaxl amino acid 
sequence were selected as immunogens based on five goals. 1) Maximize differences 
between Zmaxl and LRP6 amino acid sequences (71% amino acid identity). See Fig. 27. 
For sequence comparison, typically one sequence acts as a reference sequence, to which test 
sequences are compared. When using a sequence comparison algorithm, test and reference 
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sequences are input into a computer, subsequence coordinates are designated, if necessary, 
and sequence algorithm program parameters are designated. The sequence comparison 
algorithm then calculates the percent sequence identity for the test sequence(s), relative to the 
reference sequence, based on the designated program parameters. 2) Minimize potential 
5 cross reactivity with other known genes by performing sequence alignment and similarity 
searches. Optimal alignment of sequences for comparison can be conducted, e.g., by the 
local homology algorithm of Smith et aL 9 1981 Adv. Appl Math. 2: 482, by the homology 
alignment algorithm of Needleman et al 9 1970 Mol. Biol. 48: 443, by the search for 
similarity method of Pearson et aL, 1988 Proc. Natl. Acad. Set USA 85: 2444, by 

10 computerized implementations of these algorithms and others in programs contained in the 
Wisconsin genetics software package, Genetics Computer Group, 585 Science Dr., Madison, 
WI, or by visual inspection (see generally Ausubel et aL, Current Protocols in Molecular 
Biology, John Wiley & Sons (1997). Another example of algorithm that is suitable for 
determining percent sequence identity and sequence similarity is the BLAST algorithm, 

15 which is described in AltschvH et aL, 1990 J. Mol. Biol. 215: 403-10. Softwarefor 
performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information. 3) Obtain peptides with the highest antigenicity index as possible 
as determined by PeptideStructure protein analysis using software programs contained in the 
Wisconsin Genetics software package, Genetics Computer Group, 575 Science Dr., Madison, 

20 WI. 4) Locating peptides relative to highly homologous domains (e.g., EGF-like domains 

and LDL receptor repeats) within the gene family and the location relative to the extracellular 
and cytoplasmic regions of the gene. 5) And, for human Zmaxl specific antibodies, the 
human amino acid sequence (SEQ ID NO: 3) was compared to the mouse Zmaxl sequence 
(GenBank Accession No. AF064984) and peptides were selected based on the above criteria 

25 in addition to minimizing the sequence similarity between the two species (See Figure 26). 

Using the same criteria above, LRP6 specific peptides were selected for polyclonal 
antibody production. Table 1 1 lists the amino acid sequences that were chosen, the amino 
acid differences within the peptide between the human and mouse sequences. All the peptide 
sequences (ranging from 12-18 amino acids) were provided to Sigma/Genosys (St. Louis, 
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MO) for peptide synthesis and subsequent polyclonal antibody production in New Zealand 
White Rabbits. The IgG fraction from the serum of each immunized rabbit was isolated 
using Protein G Sepharose (Amersham). Polyclonal antibody generation using these peptides 
may be done in other species as well, for example, chickens. This is often advantageous 
when there is a high degree of similarity between the human (reference) and murine/rodent 
sequence. 



TABLE 11 



Amino 
Acids 


Amino Acids 


SEQ 
ID NO 


H/M* 
Differences 


Comments 


171-187 


VETPREERAGMDGSTRK 


703 


5 


Contains HBM 
polymorphism 


264-278 


NKRTGGKRKE1LSAL 


704 


3 


Extracellular 


290-301 


ERQPFFHTRCEE 


705 


2 


Adjacent to EGF-I, 
extracellular 


532-546 


VDGTKRRTLLEDKLP 


706 


5 


Extracellular 


901-915 


DGLNDCMHNNGQCGQ 


707 


2 


In EGF-m, extracellular 


1010-1021 


PFVLTSLSQGQN 


708 


6 


Extracellular, human 
specific 


1415-1429 


YAGANGPFPHEYVSG 


709 


3 


Cytoplasmic 


1452-1464 


ACGKSMMSSVSLM 


710 


' 5 


Cytoplasmic, human 
specific 


1556-1573 


RWKASKYYLDLNSDSDPY 


711 


1 


Cytoplasmic 


888-902 


SGWNECASSNGHCSH 


712 




LRP6 specific 


1308-1321 


NGDANCQDKSDEKN 


713 




LRP6 specific 



* H/M-differences between human and mouse sequences 



Antibodies towards variants of LRP6, HBM and LRP5 are also contemplated. Based 
on the analysis of the structural model of the LRP5 beta propeller 1 (discussed in more detail 
in the Examples), interior regions of the propeller were analysed. Since the site-directed 
mutagenesis experiments had confirmed that modulation of propeller 1, particularly in the 
interior regions of beta propeller 1, could result in an HBM effect, a strategy was employed to 
generate antibodies with epitopes specific to these regions. Such antibodies to the wild type 
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LRP5 receptor could serve, for examples, as anHBM mimetic, by altering ligand/receptor 
interactions, protein-protein interactions, or by modulating Wnt or Dkk activity. Such 
antibodies could be used as therapies to treat, for example osteoporosis. 
One set of preferred antibodies includes: 

LRP5 Amino Acids Sequence 

208-223 KLYWADAKLSFIHRAN 

277-291 ALYSPMDIQVLSQER 

61-73 GLEDAAAVDFQFSKGA 

234-247 EGSLTHPFALTLSG 

249-264 TLYWTDWQTRSIHACN 

144-156 VLFWQDLDQPRAI 

194-210 IYWPNGLTIDLEEQKLY 

34-47 LLLFANRRDVRLVD 

75-89 GAVYWTDVSEEAIKQ 

121-135 KLYWTDSETNRIEVA 
Similar peptides can be used to prepare antibodies based on the LRP6 or HBM structures 

and the proposed mutations in their propellers. Other antibodies could easily be prepared 

based on the structural model discussed in the Examples, as would be readily appreciated 

by the skilled artisan. 

Another series of antibodies could be prepared which target the domains on HBM, 

LRP5, LRP6 and their variants which interact Dkk. Such antibodies could again serve as 

HBM mimetics, for example by displacing Dkk binding and thereby could be used as an 

osteoporosis therapeutic. Preferred peptides for preparing antibodies include but are not 

limited to: 

T,RP5 Amino Acids Sequence 

969-993 LILPLHGLRNVKAIDYDPLDKFrYW 
989-1013 KFIYWVDGRQNIKRAKDDGTQPFVL 
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LRP5 Amino Acids Sequence 

1009-1033 QPFVLTSLSQGQNPDRQPHDLSIDI 

1029-1053 LSroiYSRTLFWTCEATNTINVHRL 

10491073 NVHRLS GEAMG VVLRGDRDKPRAIV 

1253-1266 CGEPPTCSPDQFAC 

1278-1295 WRCDGFPECDDQSDEEGC 

1316-1332 RCDGEADCQDRSDEADC 

1370-1383 CEITKPPSDDSPAH 

Additional peptides would be readily apparent to the artisan of ordinary skill. 



19.3.1 Sin gle chain Fv molecules developed by phage display 
Peptides were chosen from the Zmaxl (LRP5) sequence (SEQ ID NO: 3) to screen 
for single chain Fv (scFv) molecules by phage display. Similar peptides can be chosen for 
LRP6, HBM and HBM-like proteins to screen for scFv molecules. As discussed below, all 
mention of LRP5 is also meant to include these other proteins. 

A total of 17 peptides from the Zmaxl sequence were selected for synthesis and 
subsequent phage display screen for scFv molecules. All peptide synthesis and phage 
display work was performed at Cambridge Antibody Technology (CaT) in Cambridge, UK. 
Peptides were selected based on criteria as described above. 



TABLE 12 



Protein Domain 


Zmaxl Residues 


LRP6 Residues 


% Identity 


Spacer 1 (+G171V) 


161-181 


148-168 [ 


76% 


Spacer 1 (-G171V) 


161-181 


148-168 


76% 


EGF1 


301-321 


288-308 


76% 


Spacer 2 


401-421 


388-408 


52% 


EGF2 


611-631 


598-618 


62% 


Spacer 3 


781-801 


768-788 


62% 



128 



WO 02/092000 



PCT/US02/14877 



Protein Domain 


Zmaxl Residues 


LRP6 Residues 


% Identity 


EGF3 


921-941 


908-928 


10% 


Spacer 4 


1000-1021 


988-1008 


26% 


EGF4 


1229-1249 


1219-1239 


76% 


LDLR1 


1261-1282 


1252-1272 


81% 


LDLR2 


1300-1320 


1290-1310 


57% 


LDLR3 


1338-1358 


1328-1348 


48% 


Cytosolic 1 


1418-1438 


1405-1425 


14% 


Cytosolic 1 


1516-1536 


1503-1525 


52% 


Cytosolic 1 


1535-1555 


1524-1544 


81% 


Cytosolic 1 


1595-1615 


1592-1613 


82% 


Spacer 2 (cross reactive) 


421-441 


408-428 


100% 



Note that a number of these regions (e.g., 401-421, 421-441, 781-801, and 1229- 
1249) share 100% identity with mouse LRP5 (see Fig. 26). Therefore, these may be used 
15 against both mouse and human forms of the protein. The peptide 421-441 was included to 
facilitate the generation of an antibody that would recognize both Zmaxl (LRP5) and LRP6 
(see Figure 27). Two peptides were synthesized spanning the HBM mutation site (Zmaxl 
residues 161-181), one with the Zmaxl sequence and the other containing the HBM 
sequence. 

20 Once scFv molecules were isolated, they were used as reagents in immunochemistry 

to detect Zmaxl (LRP5) protein expression in a variety of human normal and diseased 
tissues. The details of the scFV antibody immunohistochemical analysis of three phage 
clones against peptide 1000-1021 (i.e., IKRAKDDGTQPFVLTSLSQGQN; SEQ ID NO: 
714) of the extracellular domain of Zmaxl showed positive staining with cardiac muscle, 

25 kidney, lung and liver.. Expression was also detected in prostate carcinoma. These results 
are consistent with mRNA tissue distribution profiles as well as with the published reports 
of LRP5 mRNA localization (Kim et al , 1998 J. Biochem. 124: 1072-6). The resulting 
phage clones arise from pools and will be sequenced to identify potential variants in the Fv 
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region of the molecules. Once identified, the suitable scFVs can then be subcloned into 
variable heavy chain and variable light chain DNA constructs for cotransfection into COS 
cells for final assembly of an intact and functional immunoglobulin gamma (IgG) molecule. 
The IgG that is expressed by the cells can then be further characterized for specificity and 
5 reactivity as would be known in the art. 

19.3.2 Monoclonal antibody development 

Monoclonal antibodies can be developed to Zmaxl (LRP5), LRP6, HBM and HBM- 
like proteins/polypeptides by complete cell and adenovirus immunization of, for example, 

10 Balb/c mice (antibodies to all forms are contemplated even when only a few examples are 
discussed in detail). Dendritic cells can be isolated from spleens of Balb/c mice, for 
example, and the cells expanded in vitro in the presence of growth factors EL-4 and GM- 
CSF. The dendritic cells can then be infected with HBM or Zmaxl adenovirus particles. 
The cells are then cultured for 24 hours prior to intravenous injection into Balb/c mice. 

15 Dendritic cells (lxlO 6 cells/mouse) are injected 2-3 times every 3-4 weeks over a three 
month period. 

Alternatively, purified HBM and Zmax DNAs in, for example, the pcDNA3.1 
expression vector, can be coated on colloidal gold particles. These particles can then be 
injected subcutaneously into the desired mouse using gene gun technology. Approximately, 

20 5 yug cDNA/mouse can be injected. Injections are performed 4-6 times every 2 weeks over 
approximately a 3 month period. 

Another option is that cells (any species of animal, but preferably Balb/c mouse 
strain or the same species as the mouse strain being used which is related to limit antigen 
response to non-specific protein) over expressing HBM and Zmaxl (LRP5) and their 

25 respective adenovirus will be injected into the mice every 2-3 weeks for a period of about 
1 .5 to 3 months, as necessary. The bleeds from the mice can be tested for reactivity with 
the native and denatured protein by ELISA (using purified protein or protein-fusions), cell 
based ELISA, immunohistochemical staining and Western blotting. Serum samples from 
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the animals can be screened by FACS (fluorescent activated cell sorting) using cells 
infected with Zmaxl or HBM adenovirus. The spleen cells (antibody producing cells) from 
the mice with the strongest reactivity can then be fused with a myeloma to generate the 
hybridoma cells. The conditioned media from the hybridoma is then screened for the 
positive cell colonies for subsequent cloning. These cloned cells can then be injected into 
the intraperitoneal space in mice for ascites production. 

19.3.3 Polyclonal Antibody Applications 

Polyclonal antibodies directed against Zmaxl (LRP5) and LRP6 were developed to 
determine die function of these proteins, analyze the expressed pattern and levels in various 
tissues, cells or any biological sample. Similar antibodies could be prepared which 
distinguish the wild type forms from the HBM and HBM-like variants. 

Uses for polyclonal antibodies against Zmaxl, HBM, HBM-like polypeptides, and 
LRP6 include but are not limited to: analysis of bone cross-sectional mounts, tissue 
distribution, evaluation of expression of the protein from bone biopsy samples of 
affected/non-affected family members (e.g., bone cell digests, explants of bone marrow 
stromal cell cultures), evaluation of protein expression levels in transiently or stably 
transfected cells, evaluating protein concentration in tissues, serum or body fluid, 
purification of full length or fragments of these proteins for ligand hunting and functional 
assay development, identification of ligands or proteins which interact with these proteins, 
and elucidations of the signaling pathways of LRP6, Zmaxl (LRP5) and HBM, and related 
variants. 

For example, Zmaxl cloned inpcDNA3.1 (Invitrogen, Carlsbad, CA). This was 
used to generate 35 S-labeled in vitro translated (Promega, Madison, WI) Zmaxl. Antibody 
(10 Atg/ml) 3109 and 3110, which are directed against peptide immunogen, 
RWKASKYYLDLNSDSDPY (SEQ ID NO:711), was combined with 20 fil of the in vifro 
translated product in the presence of either 10 |u.g/ml specific peptide (i.e., 
RWKASKYYLDLNSDSDPY; SEQ ID NO: 711) or non-specific peptide (i.e., 
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SGWNECASSNGHCSH; SEQ ID NO: 712) or no peptide and incubated for 1.5 hr at 4°C. 
Protein A Sepharose was then added to the samples (previously blocked for about 1 .5 hr with 
reticulocyte lysate), and the samples were shaken for 1 hour at 4°C. The protein A Sepharose 
was washed 3 times with 0,5 ml of PBS. The bound protein was subsequently separated on a 
5 4-12% gradient NuPAGE gel (Invitrogen) according to manufacturer's instructions. The gel 
was dried at 80 °C for 30 min and then exposed to Kodak X-OMAT-AR film for 24 to 48 hr. 
The specific peptide was observed to significantly compete for the 35 S-labeled Zmaxl 
immunoprecipitated protein with either antibody. The competition was not observed with a 
non-specific peptide. 

10 These antibodies can also be used for immunohistochemistry. For example, HBM 

transgenic and wild-type mice were sacrificed using C0 2 narcosis. Mouse calvariae were 
removed intact, and the soft tissues gently dissected. The bones were fixed in 10% phosphate 
buffered formalin for 24 hours for further processing and analysis. After fixation, calvariae 
were decalcified in TBD-2 decalcifying agent (Shandon, Pittsburgh, PA) for about 7-8 hours 

1 5 and then dehydrated in graded alcohol Calvariae were then bisected perpendicular to the 

sagittal suture through the central portion of the parietal bones parallel to the lambdoidal and 
coronal sutures and embedded in paraffin. Four to six 5 fm\ thick representative sections 
were cut. 

For example, the rabbit polyclonal antibody, Zmaxl/HBM (i.e., antibody 3109 and 
20 3110) recognize Zmaxl (LRP5) in both HBM transgenic and wild-type mouse calvariae. An 
anti-Zmaxl or anti-HBM antibody can be used to detect Zmaxl or HBM protein in order to 
evaluate its abundance and pattern of protein expression. Detection can be facilitated by 
coupling (i.e., physically linking) the antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic groups, fluorescent materials, 
25 luminescent materials, bioluminescent materials, and radioactive materials. Examples of 
suitable enzymes include horseradish peroxidase, alkaline phosphatase, P-galactosidase, or 
acetylcholinesterase; example of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
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fluorescein, dansyl chloride or phycoerythrin; and example of a luminescent material includes 
luminol; examples of bioluminescent materials include luciferase, luciferin and acquorien; 
and examples of suitable radioactive material include 125 I, 131 I, 35 S, and 3 H. Alternatively, a 
secondary antibody can be employed that detects the presence of the primary Zmaxl 
polyclonal antibody. An example would be an antibody that recognized all rabbit 
immunoglobulins. This secondary antibody could be coupled in an identical manner as 
described above to facilitate detection. Controls comprised samples with the avidin 
peroxidase, but without antibody. Intensive positive staining of stroma cells and 
mesenchymal cells was observed in the suture area. Pre-osteoblasts and osteoblasts were 
observed to stain within the periosteum and some osteocytes with antibody 3109 and 31 10 in 
the calvariae of the HBM compared to wild-type mice. High magnification of tissue calvaria 
sections of the HBM transgenic mice showed a pronounced cell membrane staining of the 
osteocytes and the cells within the suture area. 

Similar antibodies could be prepared to HBM-like proteins and polypeptides, as 
would be readily appreciated by the artisan of ordinary skill. 

20. Methods of Use: Gene Therapy 

In recent years, significant technological advances have been made in the area of gene 
therapy for both genetic and acquired diseases (Kay et al 9 1997 Proc. Natl Acad. Set USA, 
94: 12744-6). Gene therapy can be characterized as the deliberate transfer of DNA for 
therapeutic purposes. Improvement in gene transfer methods has allowed for development of 
gene therapy protocols for the treatment of diverse types of diseases. Gene therapy has also 
taken advantage of recent advances in the identification of new therapeutic genes, 
improvement in both viral and nonviral gene delivery systems, better understanding of gene 
regulation, and improvement in cell isolation and transplantation. 

The preceding experiments identify the HBM gene as a dominant mutation conferring 
elevated bone mass. Additional HBM-like genes are identifiable based on the model and data 
presented herein regarding the propellers and structure of the HBM protein. The fact that this 
mutation is dominant indicates that expression of the HBM protein causes elevated bone 
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mass. Older individuals carrying the HBM gene, and, therefore expressing the HBM protein, 
do not suffer from osteoporosis. These individuals are equivalent to individuals being treated 
with the HBM protein. These observations are a strong experimental indication that 
therapeutic treatment with the HBM protein prevents osteoporosis. The bone mass elevating 
5 activity of the HBM gene is termed "HBM function" or "HBM like phenotype" in the instance 
of other HBM variants. 

Therefore, according to the present invention, a method is also provided of supplying 
HBM function to mesenchymal stem cells (Onyia et al, 1998 J. Bone Miner. Res. 13: 20-30; 
Ko et aL, 1996 Cancer Res. 56: 4614-9). Supplying such a function provides protection 

10 against osteoporosis. The HBM gene or a part of the gene or other HBM-like gene may be 
introduced into the cell in a vector such that the gene remains extrachromosomal. In such a 
situation, the gene will be expressed by the cell from the extrachromosomal location. 

Vectors for introduction of genes both for recombination and for extrachromosomal 
maintenance are known in the art, and any suitable vector may be used. Methods for 

15 introducing DNA into cells such as electroporation, calcium phosphate co-precipitation, and 
viral transduction are known in the art, and the choice of method is within the competence of 
one skilled in the art (Robbins, ed., Gene Thera py P rotocols, Human Press, NJ (1997)). Cells 
transformed with the HBM gene (or HBM-like gene) can be used as model systems to study 
osteoporosis and drug treatments that promote bone growth. 

20 As generally discussed above, the HBM or HBM-like gene or biologically active 

fragment thereof, where applicable, may be used in gene therapy methods in order to increase 
the amount of the expression products of such genes in mesenchymal stem cells (in all 
instances where discussing HBM gene and its cognate product, the HBM-like gene and 
cognate protein is also contemplated). It may be useful also to increase the level of 

25 expression of a given HBM protein, or a fragment thereof, even in those cells in which the 
wild type gene is expressed normally. Gene therapy would be carried out according to 
generally accepted methods as described by, for example, Friedman, Therapy for Genetic 
Diseases, Friedman, Ed., Oxford University Press, pages 105-121 (1991). 

A virus or plasmid vector containing a copy of the HBM gene linked to expression 
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control elements and capable of replicating inside mesenchymal stem cells, is prepared. 
Suitable vectors are known and described, for example, in U.S. Patent No. 5,252,479 and WO 
93/07282, the disclosures of which are incorporated by reference herein in their entirety. The 
vector is then injected into the patient, either locally into the bone marrow or systemically (in 
5 order to reach any mesenchymal stem cells located at other sites, i.e., in the blood). If the 
transfected gene is not permanently incorporated into the genome of each of the targeted 
cells, the treatment may have to be repeated periodically. 

Gene transfer systems known in the art may be useful in the practice of the gene 
therapy methods of the present invention. These include viral and non-viral transfer methods. 

10 A number of viruses have been used as gene transfer vectors, including polyoma, i.e., SV40 
(Madzak et al, 1992 J. Gen. Virol. 73: 1533-6), adenovirus (Berkner, 1992 Curr. Top. 
Microbiol. Immunol. 158: 39-61; Berkner et al., 1988 BioTechniques, 6: 616-629; Gorziglia 
etal, 1992 J. Virol. 66: 4407-12; Quantine* al, 1992 Proc. Natl. Acad. Sci. USA 89: 2581- 
2584; Rosenfeld et al., 1992 Cell 68: 143-155; Wilkinson et al., 1992 Nucl. Acids Res. 20: 

15 2233-39; Stratford-Perricaudet et al. , 1 990 Hum. Gene Titer. 1 : 241-256), vaccinia virus 

(Mackett et al, 1992 Biotechnology 24: 495-499), adeno-associated virus (Muzyczka, 1992 
Curr. Top. Microbiol. Immunol. 158: 91-123; Ohi et al., 1990 Gene 89: 279-282), herpes 
viruses including HSV and EBV (Margolskee, 1992 Curr. Top. Microbiol. Immunol. 158: 67- 
90; Johnson et al, 1992 J. Virol, 66: 2952-65; Fink et al, 1992 Hum. Gene Titer. 3: 1 1-9; 

20 Breakfield et al, 1987 Mot Neurobiol. 1: 337-371; Fresse et al, 1990 Biochem. Pharmacol, 
40: 2189-2199), and retroviruses of avian (Brandyopadhyay et al, 1984 Mol Cell Biol, 4: 
749.754; Petropouplos et al, 1992 J. Virol 66: 3391-3397), murine (Miller, 1992 Curr. Top. 
Microbiol. Immunol 158: 1-24; Miller et al, 1985 Mol Cell Biol. 5: 431-437; Sorge et al, 
1984 Mol. CellBiol. 4: 17 '30-7; Mann etal, 1985 J. Virol. 54: 401-7), and human origin 

25 (Page et al, 1990 J. Virol. 64: 5370-6; Buchschalcher et al, 1992 J. Virol. 66: 2731-9). Most 
human gene therapy protocols have been based on disabled murine retroviruses. 

Non-viral gene transfer methods known in the art include chemical techniques such as 
calcium phosphate coprecipitation (Graham et al, 1973 Virology 52: 456-67; Pellicer et al, 
1980 Science 209: 1414-22), mechanical techniques, for example microinjection (Anderson 
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et al, 1980 Proc. Natl. Acad. Sci. USA, 77: 5399-5403; Gordon et al, 1980 Proc. Natl Acad. 
Sci. USA, 77: 7380-4; Brinster et al., 1981 Cell 27: 223-231; Constantini et al, 1981 Nature 
294: 92-94), membrane fusion-mediated transfer via liposomes (Feigner et al, 1987 Proc. 
Natl. Acad. Sci. USA 84: 7413-7; Wang et al, 1989 Biochemistry 28: 9508-14; Kaneda et al, 
1989 J. Biol. Chem. 264: 12126-9; Stewart et al, 1992 Hum. Gene Titer. 3: 267-275; Nabel et 
al, 1990 Science 249: 1285-8; Lim et al, 1992 Circulation 83: 2007-1 1), and direct DNA 
uptake and receptor-mediated DNA transfer (Wolffs al, 1990 Science 247: 1465-8; Wu et 
al, 1991 BioTechniques 1 1 : 474-85; Zenke et al, 1990 Proc. Natl. Acad. Sci. USA 87: 3655- 
9; Wu et al, 1989 J. Biol Chem. 264: 16985-7; Wolff et al, 1991 BioTechniques 11: 474-45; 
Wagner et al, 1990; Wagner et al, 1991 Proc. Natl Acad. Sci. USA, 88: 4255-9; Cotten et 
al, 1990 Proc. Natl. Acad. Sci. USA, 87: 4033-7; Curiel et al, 1991 Proc. Natl. Acad. Sci. 
USA, 88: 8850-4; Curiel etal, 1991 Hum. Gene Ther. 3: 147-54). Viral-mediated gene 
transfer can be combined with direct in vivo vectors to the mesenchymal stem cells and not 
into the surrounding cells (Romano et al, 1998 In Vivo, 12: 59-67; Gonez et al, 1998 Hum. 
Mol. Genetics 7:1913-9). Alternatively, the retroviral vector producer cell line can be 
injected into the bone marrow (Culver et al, 1992 Science 256: 1550-2). Injection of 
producer cells would then provide a continuous source of vector particles. This technique has 
been approved for use in humans with inoperable brain tumors. 

In an approach which combines biological and physical gene transfer methods, 
plasmid DNA of any size is combined with a polylysine-conjugated antibody specific to the 
adenovirus hexon protein, and the resulting complex is bound to an adenovirus vector. The 
trimolecular complex is then used to infect cells. The adenovirus vector permits efficient 
binding, internalization, and degradation of the endosome before the coupled DNA is 
damaged. 

Liposome/DNA complexes have been shown to be capable of mediating direct in vh>o 
gene transfer. While in standard liposome preparations the gene transfer process is non- 
specific, localized in vivo uptake and expression have been reported in tumor deposits, for 
example, following direct in situ adrninistration (Nabel, Hum. Gene Titer., 3:399-410 (1992)). 
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21. M ethods of Use: Transformed Hosts and Transgenic Animals as Research Tools 
and for the Development of Pharmaceuticals 

Cells and animals that carry the HBM, HBM-Kke, Zmaxl (LRP5), or LRP6 genes, 
used as model systems, are valuable research tools to study and test for substances that have 
potential as therapeutic agents (Onyia et aL, 1998 J. Bone Miner. Res., 13: 20-30; Broder et 
aL, 1997 Bone, 21: 225-35). Discussion of one of these is meant to include the others, e.g., 
discussion of HBM is meant to include HBM-like variants, and so forth. 

Cells for this purpose are typically cultured mesenchymal stem cells. These may be 
isolated from individuals with somatic or germline HBM genes. Alternatively, the cell line 
can be engineered to cany the HBM or HBM-like gene, as described above. After a test 
substance is applied to the cells, the transformed phenotype of the cell is determined. Any 
trait of transformed cells can be assessed, including, for example, formation of bone matrix in 
culture (Broder et aL, 1997 Bone, 21: 225-235), mechanical properties (Kizer et aL, 1997 
Proa Natl. Acad. Sci. USA 94: 1013-8), expression of marker genes and response to 
application of putative therapeutic agents. 

Transgenic modified animals and cell lines may be used to test therapeutic agents. 
Transgenic modifications include, for example, insertion of the Zmaxl (LRP5) gene, LRP6, 
HBM gene or HBM-tike gene and disrupted homologous genes. Alternatively, the inserted 
Zmaxl, LRP6, HBM f and/or HBM-like gene of the animals may be disrupted by insertion or 
deletion mutation of other genetic alterations using conventional techniques, such as those 
described by, for example, Capecchi, 1989 Science 244: 1288; Valancuis et aL, 1991 Mol 
Cell Biol 11: 1402; Hasty et aL, 1991 Nature 350: 243; Shinkai et aL, Cell, 68:855 (1992); 
Mombaerts et al, Cell, 68:869 (1992); Philpott et aL, 1992 Science 256: 1448; Snouwaert et 
aL, 1992 Science 257: 1083; Donehower^a/., 1992 Nature 356: 215. After test substances 
have been administered to the animals, the growth of bone must be assessed. If the test 
substance modulates (e.g., enhances) the growth of bone, then the test substance is a 
candidate therapeutic agent. These animal models provide an extremely important vehicle for 
potential therapeutic products. 

The present invention also provides animals and cell lines wherein the expression of 
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endogenous genes are activated, and may be further amplified, which does not require in vifro 
manipulation and transfection of exogenous DNA encoding Zmaxl (LRP5) or HBM proteins. 
These methods .are described for example in PCT Application WO 94/12650 and U.S. Patent 
No. 5,968,502, both of which are herein incorporated in their entirety by reference. In 
5 addition, the present invention includes methods wherein endogenous activation or over- 
expression is achieved by in situ homologous recombination, non-homologous 
recombination, or illegitimate recombination methods. These methods are described for 
example in PCT Applications WO 99/15659 and WO 00/49162, both of which are 
incorporated herein in their entirety. 

10 

21.1 Creating transgenic and gene-targeted animals 

The present invention provides genetically modified animals that recapitulate the 
human HBM or a HBM-like phenotype. The approaches taken involve the creation of both 
transgenic and gene-targeted animals that have the human G to T nucleotide substitution 
15 incorporated into the genome, express human Zmaxl (LRP5) or express a variant with a bone 
mass altering phenotype. These approaches can be used with any gene, such as HBM, HBM- 
like, LRP5 and LRP6 systems. 

21.1.1 Transgenic mice over-expressing the HBM polymorphism 
20 Plasmid constructs were prepared that utilized either the CMVbActin or type I 

collagen promoters to drive expression of the human HBM cDNA. The most 
commonly-used promoters for mammalian expression are from cytomegalovirus (CMV), 
Rous sarcoma virus (RSV), Simian virus 40 (SV40), and EF-la (human elongation factor la- 
subunit). CMV is derived from the human cytomegalovirus immediate-early viral promoter. 
25 CMV is a stronger promoter in most cell lines than either RSV or SV40. The RSV promoter 
is derived from an avian virus and tends to be a strong promoter in avian cell lines. The 
SV40 promoter expresses well in cell lines that carry the large T antigen, such as COS-1. In 
these cell lines, the plasmid is replicated to higher copy numbers. EF-la is beginning to be 
more widely used for recombinant protein expression. EF-la is the promoter from the human 
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elongation factor la-subunit, a gene that is highly expressed and conserved in mammalian 
cell lines. 

The chimeric CMVbActin promoter is a strong promoter that has been shown to 
produce ubiquitous gene expression in transgenic mice including bone. This promoter was 
5 chosen to drive expression of HBM in a manner consistent with the reported widespread 

expression of the endogenous mouse Zmaxl (LRP5) gene. Although the HBM phenotype is 
observed in bone, the HBM gene may have direct or indirect effects in other tissues. 
Therefore, other strong ubiquitous promoters may be utilized as would be known to those 
skilled in the art. 

10 Type I collagen promoters provide tissue-restricted gene expression wherein 

expression is primarily limited to bone. Other bone-specific promoters are available that 
could result in expression of HBM in bone. For example, promoters associated with 
proliferation of osteoblasts include histone, type I collagen, TGFJ31, MSX2, cfos/cJun and 
Cbfal may be used. Promoters associated with bone matrix maturation including alkaline 

15 phosphatase, MGP, Cbfal, Fra/Jun and Dlx5 also can be used. Promoters associated with 
bone mineralization such as osteocalcin, osteopontin, bone sialoprotein and collagenase also 
can be used. The promoter chosen would be determined by, for example, the tissue 
expression, the degree of regulatable control and the like as would be known to the skilled 
artisan. For example, the type I collagen promoters were chosen to insure that HBM would 

20 be expressed in bone in a temporal, spatial and bone cell-specific pattern resembling the 
endogenous pattern of Zmaxl (LRP5) expression in bone. 

21.1.2 Trans genic mice over-expressing the wild-type (LRP5) Zmaxl gene 
Plasmid constructs were prepared using the CMVbActin and type I collagen 
25 promoters driving expression of Zmaxl (LRP5). These animals can serve as a control animal 
model for the HBM, HBM-like and LRP6 transgenic mice (all of which are contemplated 
when discussing LRP5/Zmaxl). Additional controls include non-transgenic littermates and 
wild-type animals of an identical genetic background. Methods for preparing these animals 
would be similar to what is discussed for mice which over express the HBM polymorphism. 
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21.1.3 LRP6 gene targeted knock out mine 

LRP6 knock-out mice were generated using Omnibank embryonic stem (ES) cells 
carrying a gene trap vector which inserted into the first intron of the LRP6 gene. The insert 
location was determined to be the LRP6 gene by an Omnibank Sequence Tag (OST) 
5 generated by reverse transcription PCR (RT-PCR) of a fusion transcript comprised of 5' gene 
trap vector sequence spliced to the host gene transcript 3 f of the insertion site. The gene trap 
vector functionally knocks out the mouse LRP6 gene by forced spicing of LRP6 exon 1 to the 
ERES-LacZ-PolyA element of the gene trap, preventing transcription of LRP. 

Chimeric mice were generated with ES cells, identified as OST38808, by injection 

10 into C57BL/6 albino host blastocysts which were then transferred to pseudopregnant females 
and allowed to develop through birth. Germ line chimeras were backcrossed to 129SvEVBrd 
strain mice to maintain the knockout allele of LRP6 on an inbred 1298SvEvBrd genetic 
background. Germ line transmission of the LRP6-KO allele was identified by PCR 
amplification of a gene trap specific sequence. Heterozygous LRP6-KO mating pairs were 

15 used for continued breeding. The genotype of wt and LRP6-KO progeny is determined by tail 
DNAPCR. 

Measurements of bone density at 9 weeks of age in female heterozygous knock-out 
mice has shown significant (p<0.05) decreases in bone volume, trabecular number, and 
trabecular thickness as measured by uCT. These results are consistent with the hypothesis 
20 that LRP6 is also involved in modulating bone density and is a target for development of 
therapies and drugs. Accordingly, LRP6 transgenic animals and transgenic animals 
expressing bone modulating variants of LRP6 are contemplated within the scope of the 
invention. 

25 21.1.4 Gene-targeted mice expressing the HBM polymorp hism 

A gene-targeting construct was prepared that could be used to create animals 
containing a HBM or HBM-like knock-in (KI) allele and a Zmaxl (LRP5) knock-out (KO) 
allele. The gene-targeting construct contained the HBM polymorphism in exon 3 and 
included a neomycin selection cassette that was linked to a transcriptional stop sequence and 
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was flanked with lox P sites. The HBM polymorphism in mouse Zmaxl results in a glycine 
to valine change in the amino acid sequence at position 170 of the mouse LRP5 homo log 
(Genbank Accession No. AF064984). Homologous recombination is used to stably introduce 
the construct into the mouse genome. If the transcriptional stop sequence functioned to 
completely block transcription of the modified Zmaxl allele, then a functional Zmaxl knock- 
out allele would be generated. This would facilitate production of a homozygous knock-out 
animal for the Zmaxl gene. 

To create the knock-in allele, Cre recombinase could be used to excise the neomycin 
selection cassette leaving behind the modified exon 3 and one copy of the loxP site. Cre 
could be introduced into single-cell fertilized embryos to facilitate ubiquitous expression of 
HBM or by crossing animals with transgenic mice to obtain bone-specific HBM expression. 
Homozygous animals could be made for the HBM knock-in allele. Alternatively, animals 
could be created by nuclear transfer techniques, wherein nuclei from homozygous animals is 
transferred into a prepared oocyte (e.g., enucleated) as is known in the art. See, e.g., 
Campbell et al., Nature 380: 64-68 (1996). Additional methods of creating knock out mice 
include engineering a homologous recombination vector wherein the ATG start codon is 
deleted or mutated, engineering a frame-shift mutation into the vector, engineering deletions 
of critical portions of the promoter region, and/ or engineering a vector to delete critical 
regions of the gene. 

The methods used for the LRP5, LRP6 and HBM mice can similarly be used for other 
variants of these genes. 

21.2 Materials and Methods 

21.2.1 Construction of the Zmaxl (LRP5) plasmid Zmaxl GT_3 AS 
The full-length Zmaxl cDNA construct has been engineered into the Xbal-NotI sites 
of the pCMVSPORT6.0 vector from Life Technologies (part of the Gateway cloning system) 
to create Zmaxl GI_3 AS. The insert (5,278 bp) can be released from the vector by digestion 
with either HindHI oxXbal on the 5 f side together with either EcoRV or EcoRl on the 3 ! end. 
A similar construct can be prepared for LRP6 and HBM-like nucleic acids as would be 
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apparent to one of ordinary skill and is contemplated as well in the discussion of this section. 

The Zmaxl (LRP5) construct was generated from four independent partial clones. 
These clones were isolated from a Zmaxl specific primed cDNA library. A partial Zmaxl 
cDNA clone existed in the internal survey sequencing clone set as L236BJP0049E08 isolated 
5 from an oligo-dT primed HeLa cell cDNA library. This clone was truncated at the 5-primed 
end. In order to isolate more 5-prime containing fragments necessary to generate a frill length 
cDNA, a Zmaxl gene-specific cDNA library was generated from Clontech human liver poly- 
A mRNA (catalog #6510-1, lot# 9060032) and Life Technologies Superscript® Plasmid 
System for cDNA Synthesis and Plasmid Cloning kit (catalog no. 18248-013). This library 

10 was designated as L401 . The manufacturers protocol was carried out with the following 

modifications. l)In both first and second strand synthesis reactions, DEPC-treated water was 
substituted for a- 32 P-dCTP. 2)Reverse transcription was primed using oligonucleotides that 
were selected to be specific for the Zmaxl gene at approximately 1 kb intervals. These 
sequences were checked using the program BLAST against the public databases to ensure 

15 Zmaxl specificity. 3)Two separate reverse transcription reactions were performed. The first 
reaction, (A), was primed with oligonucleotides which annealed to the more 3' regions of 
Zmaxl (LRP5) as follows: 

47114: 5 , -CGTACGTAAAGCGGCCGCTTGGCAATACAGATGTGGGA-3 , (SEQ ID NO: 715) 

471 16: 5'-CGTACGTAAAGCGGCCGCAGTAGCTCCTCTCGGTGGC-3' (SEQ ID NO: 716) 

20 471 18: 5 , -CGTACGTAAAGCGGCCGCGCTCATCATGGACTTTCCG-3 , (SEQ ID NO: 717) 

47120: 5 ! -CGTACGTAAAGCGGCCGCGCACTGCTGTTTGATGAGG-3' (SEQ ID NO: 718) 

The second reaction, (B), used the previously mentioned four oligonucleotides, as well as 

47108: 5^CGTACGTAAAGCGGCCGCGAGTGTGGAAGAAAGGCTGC-3' (SEQ ID NO: 719) 
25 471 10: 5 , -CGTACGTAAAGCGGCCGCAGTAGAGCTTCCCCTCCTGC-3 , (SEQ ID NO: 720) 

471 12: 5-CGTACGTAAAGCGGCCGCGTCCATCACGAAGTCCAGGT-3' (SEQ ID NO: 721) 
All oligonucleotides contained a Not! linker sequence and were used at a concentration of 

0.02 ug/ul. 4)The Sa/I-adapted cDNA from both reverse transcription reactions was size- 
fractionated by electrophoresis on 1% agarose, 0.1 ug/ml ethidium bromide, lx TAE gels. 
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The ethidium bromide-stained cDNA between 0.6 and 8.0 kb was excised from the gel The 
cDNA was purified from the agarose gel by electroelution (ISCO Little Blue Tank 
Electroelutor®) using the manufacturer's protocol. The purified cDNA from reactions A and 
B were then pooled together. 5) The size-fractionated Sa/I-adapted cDNA was ligated to 
Notl-SaK digested pBluescript® (Stratagene, La Jolla, CA). 

Ligated library cDNA (3 ml) was used to transform electrocompetent E. coli cells 
(ElectroMAX® DH10B cells and protocol, Life Technologies catalog no. 18290-015, BioRad 
& coli pulser, voltage L8 KV, 3-5 msec pulse). The transformed cells were plated on LB- 
ampicillin (100 M-g/ml) agar plates and incubated overnight at 37°C. Approximately 10 6 
colony forming units (cfu) were plated at a density of 50,000 cfu/150 mm plate. Cells were 
washed off the plates with LB media, and collected by centrifugation. Plasmid DNA was 
purified from the cells using the QIAGEN Plasmid Giga Kit and protocol (catalog no. 12191) 
at a final concentration of 2.05 jig/jil. 

Two probes for use in library screening were generated by the polymerase chain 
reaction (PCR) using 100 ng of library L401 as template. Standard PCR techniques were 
used. A reaction mixture contained 10 pmol of each oligo primer; 0.2 mM each dATP, 
dTTP, dCTP and dGTP (PE Applied Biosystems catalog no. N808-0260); 1.5 units Expand™ 
High Fidelity Taq DNA polymerase and lx PCR reaction buffer (Roche Molecular 
Biochemicals, catalog no. 732-641; 10 mM Tris-HCl, 1.5 mM MgCl 2 , 50 mM KC1, pH 8.3). 
The mixture was incubated at 99°C for one minute, followed by 30 cycles of 96° C for 15 
seconds, 50° C for 30 seconds, 72° C for 1 minute with a final incubation at 72° C for 7 
minutes (MJResearch DNA Engine® Tetrat PTC-225). The first was generated using oligos 

107335: 5 , -CAGCGGCCTGGAGGATGC-3 , (SEQ ID NO: 722) 

107338: 5'-CGGTCCAGTAGAGGTTTCG-3' (SEQ ID NO: 723) 
which amplify a Notl-SaR fragment of the Zmaxl gene. The second was generated using 
oligos 

107341: 5 , -CATCAGCCGCGCCTTCATG-3 , (SEQ ID NO: 724) 
107342: 5 -CCTGCATGTTGGTGAAGTAC-3' (SEQ ID NO: 725) 
which amplify a Sacl-Kpnl fragment of the Zmaxl gene. Both PCR products were purified 
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using the Qiaquick® kit and the manufacturer's protocol (Qiagen catalog 28106) then 
subcloned into the vector pCRII-TOPO® (Invitrogen catalog no. K4600) following the 
manufacturer's protocol. Positive subclones were identified by restriction digestion of 
purified plasmid DNA (using standard molecular biology techniques) and subsequent DNA 
5 sequence analysis (ABI Prism BigDye Terminator Cycle® sequencing, catalog no. 4303 154, 
ABI 377 instruments). Probe DNA was isolated by EcdKL restriction digestion (New 
England Biolabs, catalog no. R0101L) of the respective sequence-verified pCRII-TOPO® 
clone. Restriction fragments were size-fractionated by gel electrophoresis on 1% agarose, 0.1 
ug/ml ethidium bromide, Ix TAE gels. Insert DNA was excised from the gel and purified 

1 0 using the Clontech NucleoSpin® Nucleic Acid Purification Kit (catalog no. K305 1 -2) 

following the manufacturer's protocol. The purified DNA fragment (25-50 ng) was labeled 
with Redivue ® (a 32 P)-dCTP (Amersham Pharmacia, catalog no. AA0005) using the Prime-It 
II® Random Primer labeling Kit and protocol (Stratagene, catalog no. 300385). 
Unincorporated dCTP was removed with Amersham's NICK® column and protocol (catalog 

15 no. 17-0855-02). 

Two rounds of screening library L401 were initiated to isolate fragments of the Zmaxl 
(LRP5) gene. In the first, forty-three 150 cm LB -100 yg/ml ampicillin agar plates were 
plated with primary transformants from L401 at a density of about 3,000-4,000 colonies per 
plate. This library was screened using the 32 P-labeled probe {Notl-SaR fragment) as described 

20 above at 500,000- 1 ,000,000 cpm/ml hybridization buffer, using standard molecular biology 
protocols. From this primary screen, 13 single colonies were identified based on positive 
hybridization to the Zmaxl probe. Plasmid DNA, prepared using the QIAprep Spin Miniprep 
Kit and protocol (Qiagen Inc., catalog no. 27106), was analyzed by restriction digestion and 
sequence analysis as described above. cDNA clone # 44 was isolated from this screen and 

25 sequence verified to contain a partial Zmaxl clone. 

In the second library screen, one hundred-and-four 150 cm LB ampicillin 100 ug/ml 
agar plates were plated with primary transformants from L401 at a density of 3000-4000 
colonies per plate. This library was screened using the 32 P-labeled probe (Sacl-KpnT) exactly 
as previously described. From this primary screen, 48 colonies were identified based on 
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positive hybridization to the Zmaxl (LRP5) probe. Since these colonies were not single 
colony isolates, a secondary screen was initiated where each of the 48 primary isolates was 
plated at a density of approximately 500 colonies per plate. These colonies were then 
screened exactly as the primaries using the labeled Sacl-Kpnl fragment as probe. Thirty-four 
of the 48 primary clones resulted in positive hybridization to the Zmaxl probe and were 
isolated as single colonies. Plasmid DNA was prepared and analyzed as described above. 
cDNA isolate #71_2 was isolated from this screen and sequence verified to contain a partial 
Zmaxl clone. 

In all cases, the sequence of any Zmaxl (LRP5) isolate was compared to a reference 
sequence (i.e., the sequence of the wild-type Zmaxl allele from an affected member of the 
HBM kindred). This analysis was important since DNA polymorphisms had been reported 
for this gene in the literature. This reference sequence is predicted to encode a polypeptide of 
Genbank Accession No. AF077820. 

The four independent partial clones used to prepare ZmaxIGI_3AS are as follows: 
1). Bases 1-1366: AXbal-SaR fragment was obtained from a Zmaxl cDNA 
construct, GTC.Zmaxl_13. GTC.Zmaxl_l 3 contains a 5075 BP insert containing the 
entire ORF of LRP5. The clone was blunt end cloned in the EcoRV site of pSTBlue- 
1. This clone was generated by fusing a 5' clone derived from screening a bone 
random primed cDNA library in a pBluescript™ II derivative with a 3' clone derived 
from a PGR product from a bone dT primed cDNA library in pBluescript™ II. PCR 
was performed using LRP5 specific forward primer 5'-GCCCGAAACCTCTACTGG 
ACCGAC-3' (SEQ ID NO: 726) and reverse primer 5'-GCCCACCCCATCACAGTTCA 
CATT-3' (SEQ ID NO: 727) using DNAzyme polymerase. The resultant 3.7 kb PCR 
product was cloned into PCR-XL-TOPO. To generate the full length clone the 5' and 
V plasmids were transformed into DM1 (dam-) from Gibco/BRL. The 5 s plasmid was 
digested with Xbal and the 3' plasmid was digested with HindHI. The digested 
plasmids were filled in with T4 polymerase to generate blunt ends and cut with Bell. 
the 1.7 kb 5' fragment and 3.5 kb 3' fragments were gel purified, ligated together, and 
cloned in the EcoRV site of pSTBlue-1. It provides a short 5 'UTR, with coding 
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sequence beginning at base 100. Furthermore, it carries some additional restriction 
sites at the 5' multiple cloning site. This fragment also contains a DNA 
polymorphism relative to the Genbank Accession No. AF077820 sequence at position 
558 resulting in an A (AF077820) to a G change; this mutation does not result in an 
amino acid difference (Pro). 

2) . Bases 1367-2403: This clone was obtained from a Zmaxl -gene primed cDNA 
library made from commercial human liver RNA described above. This fragment is a 
SaR-BamHi piece of DNA obtained from isolate #44. The sequence is identical to 
Genbank Accession No. AF077820. 

3) . Bases 2404-4013: This BamHL-BssHR fragment was obtained from isolate 
#71-2 from the same library as described above. At position 3456, there is a DNA 
polymorphism resulting in a G (AF077820) to an A. This nucleotide difference does 
not change the encoded amino acid (Val). 

4) . Bases 4014-5278: This BssHU-Notl fragment came from an internal clone, 
L236BJP0049E08. It was obtained from an oligo-dT primed HeLa cell cDNA 
library. The stop codon occurs at base 4947. The clone contains 331 bp of 3' UTR 
sequence, including a 120 bp poly-A tail followed by the Nod site. The 3 ' Nod site 
used in this subcloning step is a result of an added linker that was introduced at the 
end of the poly- A tail during library construction. A DNA polymorphism is present at 
base 4515 resulting in a G (AF077820) to C change that is silent at the amino acid 
level (Leu). 

To generate the 5' section of the Zmaxl (LRP5) gene, the XbahSaK fragment and 
Satl-BamHL fragment were ligated into the Xbal-BaniHl sites of pBluescript® (Stratagene). 
The 3' section of the gene was obtained by ligating the 1.61 kb BamHl-BssHSl fragment from 
Zmaxl isolate #71_2 to the 1.26 kb BssSDrNod fragment from L236BJP0049E08. These 
two fragments were ligated into the BamHL-Nod sites of pBluescript®. The full length Zmaxl 
cDNA was engineered into the Xbal-NotI sites of the vector pCMVSPORT6.0 (Life 
Technologies) by ligation of the Xbal-BamHl 5 1 section and the BamKL-Noti 3 1 section. The 
resulting plasmid, ZmaxGI__3AS, contains an insert of 5278 bp from the Xbal site to the Nod 
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site of the vector's multiple cloning site. This clone is in the antisense orientation with 
respect to the CMV promoter present in the vector. Zmaxl coding sequence begins at base 
100 and ends at base 4947, followed by 331 bp of 3-primed UTR sequence including a 120 
bp poly-A tail. This full length cDNA contains three DNA polymorphisms from the 
reference sequence (GenBank Accession No. AF077820) that do not alter the predicted 
amino acid sequence. These polymorphisms are at position 558 resulting in an A to G that 
maintains the proline residue; at position 3456 resulting in a G to A that maintains the valine 
residue; and, at position 4515 resulting in a G to C that maintains the leucine residue. 

The sequence of ZmaxlGIJ3AS (Figure 25) also contains a DNA polymorphism 
relative to SEQ ID NO. 1 at base 4088 resulting in a C (SEQ ID NO: 1 and Genbank 
Accession No: AB017498) to T change that results in an amino acid change at position 1330 
of alanine to valine. This is consistent with the sequence determined in the wild-type allele 
from an affected member of the HBM kindred as well as with the published sequence of 
Genbank Accession No. AF077820. ZmaxlGI_3AS also has 29 additional bases at the 5' end 
relative to SEQ ID NO: 1, as well as 129 bases at the 3 ! end consisting of an extra G, 120 
bases of poly-A tract, and the Notl site. . 

21.2.2 Creation of the HB M mutation G1 71 V 

The HBM mutation that results in a predicted amino acid change from glycine to 
valine at amino acid 171 was introduced into the full length human Zmaxl (LRP5) cDNA 
(plasmid ZmaxlGI_3AS) using PCR to change the G at position 61 1 to a T. Introduction of 
the HBM mutation was done using oligos 107335: (5 -CAGCGGCCTGGA GGATGC-3 r ; 
SEQ ID NO: 728) and 49513: (S'-CGGGTACATGTACTGGACAGC TGATTAGC-3'; SEQ 
ID NO: 729), which flank the endogenous Notl site of the Zmaxl (LRP5) gene. This method 
creates a new PvuIIsite at the 3' end of the PCR product. A second PCR reaction was 
completed using oligos which introduce a Seal site at the 5' end of the product and contains 
the endogenous Sail site of Zmaxl in the 3-primed end. PCR products were purified using 
the QiaQuick® procedure (Qiagen Inc.); subcloned into the vector pCRII-TOPO (Invitrogen) 
as described above. Plasmid DNA was purified from single bacterial colonies and analyzed 
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by restriction digest and subsequent sequence analysis, all as described above. The sequence- 
verified pCRII-TOPO clones were restriction digested with NothPvuU and Seal-Sail, 
respectively. The resulting DNA fragments were size fractionated and purified as described 
above. These two fragments were then subcloned into the vector, pBluescript® that had been 

5 prepared by Noil-Sail digestion. Both PvuU and Seal produce blunt ends when used to digest 
double stranded nucleic acids. Thus, the resulting ligated fragment fails to recreate either the 
PvwII or Seal site and contains only the consensus Zmaxl sequence, with the exception of the 
newly introduced HBM mutation. To introduce the mutation into the full length Zmaxl gene, 
this resulting plasmid was digested with Mscl and Sail, while the 5 f region of Zmaxl was 

1 0 obtained by Xbal-Mscl digestion of Zmaxl plasmid GTC.Zmaxl_l 3 . These two fragments 
were ligated together into Xbal-SaK digested pBluescript®, in effect creating a similar 1.366 
kb Xbal-SaK fragment. The only difference being that this construct contains the HBM 
mutation described above. The full length HBM cDNA then was assembled into 
pCMVSPORT6.0 exactly as described above for the Zmaxl gene, with the substitution of this 

1 5 newly created Xbal-SaK fragment containing the HBM mutation. The entire cDNA insert 
was verified by DNA sequence analysis and the introduction of the HBM mutation was 
confirmed. 

The resulting plasmid, HBMGI_2AS (Fig. 24), contains an insert of 5,278 bp from the 
Xbal site to the Notl site of the vector's multiple cloning site. This clone is in the antisense 

20 orientation with respect to the CMV promoter in the vector. HBM coding sequence begins at 
base 100 and ends at base 4947, followed by 331 bp of 3-primed UTR sequence which 
includes a 120 bp poly-A tail. This full length cDNA contains three DNA polymorphisms 
from the reference sequence, which do not alter the amino acid sequence. These 
polymorphisms occur at position 558, resulting in an A to G change that maintains the 

25 proline residue; at position 3456 resulting in a G to A change that maintains the valine 

residue; and, at position 4515 resulting in a G to C change that maintains the leucine residue. 
Additionally, the HBM mutation is present at position 61 1 (G in Zmaxl to T in HBM) which 
results in a predicted amino acid change of glycine to valine at amino acid position 171, as 
found in affected members of the HBM kindred. This insert sequence was used to generate 
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the construct used for HBM over-expressing transgenic mice. 
21.3 Transgene preparation 

The examples provided herein are illustrations of how transgenic animals can be 
prepared. Additional transgenic animals can be prepared as would be known in the art. See, 
for example, Glenn Monastersley et aL, ed, Strategies in Transgenic Animal Studies (Amer. 
Soc. Microbiology 1995) and the references cited therein. 

21.3.1 CMVhActin promoter-HBM cDNA fHBMMCBA) 

To prepare the CMV6actin-HBM construct, pCX-EGFP, a plasmid containing the 
chimeric CMVZ>actin promoter, was purified as a 4778 bp EcdSl fragment. Subsequently, 
the HBM cDNA was excised from HBMGI_2AS as a 4994 bp XbaVDral fragment, treated 
with Klenow fragment of DNA polymerase, ligated to EcoRI linkers, and digested with 
EcoW. This fragment was then inserted into the EcoRl site of pCX-EGFP. A Spel/HindUI 
7265 bp CMV6actin-HBM fragment was purified for microinjection into mouse embryos. 

21.3.2 Ty pe T collagen promoter-HBM cDNA fflBMMTIC) 

The rat type I collagen promoter-HBM construct was created by first replacing the 
pBS(SK-) (Stratagene) polylinker with another polylinker (i.e., comprising Kpnl - Spel - 
Hindm - Bglll - Ndel - Sail - SmaL - EcoRl - Pstl - BamKL - Xbal - Seal - Ncol - Clal - 
Notl- Sacll - Sad), that is referred to as BS(SK-)A7D. The SV40 splice and poly (A) n ^&al- 
Ncol region (750 bp) from pcDNA I (Invitrogen, Inc.) was directionally cloned into BS(SK- 
)A/D. Next, a 4994 bp EcoRI HBM cDNA fragment (above) was cloned into the EcoRI site. 
A 3640 bp Xbal, type I collagen promoter fragment was subcloned into thecal site of 
BS(SK-) (Stratagene). The promoter fragment was then excised from BS(SK-) with Sacll, 
blunt-ended with T4 DNA polymerase, digested with Spel, and ligated into the HBM BS(SK- 
) A/D construct, which was digested with Nde I, blunted with T4 DNA polymerase, and 
digested with Spel. A Spell Clal 9435 bp type I collagen-HBM fragment was purified for 
microinjection into mouse embryos. 
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21.3.3 rMVHActm promoter-ZmaxI cDNA fZmaxIWTCB A) 

The CMV&Actin promoter-HBM cDNA construct from above was used to generate 
the final plasmid. The following three fragments were ligated together: 1) a 6.34 kb Xbal- 
Kpnl backbone fragment from HBMMCBA; 2) a 0.64 kb Xbal-Sapl fragment from 
HBMMCBA containing the 3' end of the bActin promoter and the 5' end of the HBM cDNA; 
and 3) a 2.8 kb Sapl-Kpnl fragment derived from the Zmaxl cDNA that contains the wild- 
type sequence. A 7.2 kb Spel - HindUl CMV&Actin - Zmaxl fragment was purified for 
micro-injection into mouse embryos. 

21.3.4 Type T colla gen promoter-ZmaxI cDNA (Zmaxl WTTIQ 

The type I collagen - HBM cDNA construct from above was used to generate the 
final plasmid. The HBMMTIC plasmid was linearized with HinalR and cut with either Sail 
to yield a 8.52 kb Hindm - Sail fragment or Sapl to yield a 2.98 kb Sapl - HinalXL fragment. 
A 2.8kb Sapl - Sail fragment from the Zmaxl (LRP5) cDNA containing the wild-type 
sequence was then ligated to the above two fragments to yield the final plasmid. A 9.4 kb 
Spel - CM type I collagen - Zmaxl fragment was purified for micro-injection into mouse 
embryos. 

21.4 Confirmatio n of transgene expression in vitro 

Plasmid constructs for HBMMCBA, HBMMTIC, Zmaxl WTCBA and Zmaxl WTTIC 
were transiently transfected into human osteoblast (HOB) cells to measure mRNA expression 
as a test for functionahty. 

21.4.1 Transient Transfections 

HOB-02-02 cells are a clonal, post-senescent, cell line derived from the HOB-02-C1 
cells (Bodine et. al, 1996, /. Bone Miner. Res. 11: 806-819). Like the parental cell line, the 
HOB-02-02 cells express the temperature-sensitive SV40 large T-antigen mutant, tsA209. 
Consequently, these cells proliferate at the permissive temperature of 34°C, but stop dividing 
at non-permissive temperatures of 37 °C or above. Also like the parental cell line, the HOB- 
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02-02 cells are cultured with Growth Medium (D-MEM/F-12 containing 10% heat 
inactivated fetal bovine serum, 1% penicillin-streptomycin and 2 mM GlutaMAX-1) at 34°C 
in a 5% C0 2 /95% humidified air incubator (Forma Scientific, Marietta, OH). 

For the transient transfections, the HOB-02-02 cells were seeded with Growth 
Medium at 400,000 cells/well into 6-well plates and incubated overnight at 34°C. The cells 
were transfected with 03 mg/well of either the CMVpActin-HBM expression plasmid, the 
Type I Collagen-HBM expression plasmid or the corresponding empty vectors using 
LipofectAMINE 2000 transfection reagent according to the manufacturer's instructions (Life 
Technologies, Rockville, Md). After a 24 hr incubation at 34°C, the medium was changed, 
and the cells were incubated for an additional 24 hr at 39°C. At the end of this last 
incubation, the cells were rinsed with Hank's buffered salt solution. Total cellular RNA was 
then isolated using TRIzol® according to the manufacturer's instructions (GibcoBRL, Grand 
Island, NY). The RNA was treated with RNase-free DNase in order to remove contaminating 
DNA as previously described (Bodine et al, 1997, 1 Cell Biochem. 65: 368-387). 

21.4.2 Ta qMan® a ssay for mRNA expression 

TaqMan® primers and probes were chosen based on human and mouse Zmaxl 
(LRP5) cDNA sequences. The selected sequences were designed to be gene-specific by 
analysis of an alignment of human and mouse Zmaxl (LRP5) sequences as illustrated in Fig. 
26. 

TaqMan® quantitative reverse transcriptase-polymerase chain reaction (RT-PCR) 
analysis of RNA isolated from human cells was performed as described by the manufacturer 
(PE Applied Biosystems, Foster City, CA) using the following primers and probe set: 
Human Zmaxl-l/HBM-1: 

Forward Primer: 5-GTCAGCCTGGAGGAGTTCTCA-3 1 (SEQ ID NO: 730) 
Reverse Primer: 5-TCACCCTTGGCAATACAGATGT-3 1 (SEQ ID NO:73 1) 
Probe: 6-FAM-5'-CCCACCCATGTGCCCGTGACA-3' (SEQ ID 

NO:732) 

Results from the experimental primers/probe set were normalized to human GAPDH 
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levels using the multiplex protocol with the human GAPDH control kit from PE Applied 
Biosystems. Species-specific TaqMan® quantitative RT-PCR analysis of RNA isolated from 
murine cells and tissues was performed as described by the manufacturer (PE Applied 
Biosystems, Foster City, CA) using the following primers and probes sets: 
5 Human Zmax-l/HBM- 1 : 

Forward Primer: 5-CGTGATTGCCGACGATCTC-3 1 (SEQ ID NO:733) 
Reverse Primer: 5'-TTCCGGCCGCTAGTCTTGT-3' (SEQ ID NO:734) 
Probe: 6-FAM-5 , -CGCACCCGTTCGGTCTGACGCAGTAC-3' (SEQ 

ID NO:735) 
10 Mouse Zmax-l/HBM- 1: 

Forward Primer: 5'-CTTTCCCCACGAGTATGTTGGT-3' (SEQ ID NO:736) 
Reverse Primer: 5'-AAGGGACCGTGCTGTGAGC-3' (SEQ ID NO :737) 
Probe: 6-FAM-5'-AGCCCCTCATGTGCCTCTCAACTTCATAG-3' 
(SEQ ID NO:738) 

1 5 Results from the experimental primers/probe sets were normalized to 1 8S ribosomal 

RNA levels using the multiplex protocol with the 18S ribosomal RNA control kit from PE 
Applied Biosystems. A summary of these results is presented in Figures 17-20. 

21.5 Production of transgenic mice 

20 21.5.1 DNA micro in jection 

Transgene fragments for micro-injection were first purified on 1% agarose gels 
according to the GELase protocol from Epicentre Technologies. Fragments were then further 
purified on cesium chloride density gradients and extensively dialyzed against 5 mM Tris 
(pH 7.4), and 0.1 mM EDTA. 

25 Linearized DNA was microinjected into mouse embryos according to standard 

procedures. DNA was injected into primarily the male pronucleus of fertilized C57BL/6T 
mouse embryos. Injected embryos (n=20-35) were transferred to the oviduct (unilaterally) of 
day 0.5 post coitum pseudopregnant Swiss Webster embryo recipients. Offspring were tail- 
biopsied and genotyped at age 10-14 days. 
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21.6 Production of gene- ta rgeted tr ansgenic mice 

21.6.1 Gene targetin g ve ctors and probes 

Two gene-targeting vectors were constructed for modification of the Zmaxl (LRP5) 
gene in embryonic stem (ES) cells. The two constructs, illustrated in Figure 16, designated as 
Zmaxl-KI/KO A&B were designed to generate two types of mutations, a knock-out (KO) of 
the Zmaxl gene and a Cre recombinase dependent knock-in (KI) of a nucleotide substitution 
in order to create a mouse model (i.e., glycine 170 to valine amino acid substitution in mouse 
Zmaxl, of the HBM kindred. 

Both gene-targeting vectors were constructed using genomic DNA of the mouse 
genomic DNA BAC clone 473P5 (Genbank Accession No. AZ095413) containing the first 
five exons of the mouse Zmaxl gene. This clone was isolated by Research Genetics 
(Huntsville, AL) from their mouse 129SvJ genomic BAC library using a polymerase chain 
reaction (PGR) screen for exon 3. A forward and reverse primers of 

Forward: 5 '-GAGCGGGC AGGGATGGATGG-3 ' (SEQ ID NO: 739) 

Reverse: 5-AGGTTGGCACGGTGGATGAAGC-3' (SEQ ID NO: 740) 
were used to amplify exon 3 by PCR; the following thermal cycling conditions were 
employed: for thirty cycles, 95°C for 0,5 minute, 55°C for 1 minute and 72°C for 1 minute. 
Identity of this clone with mouse Zmaxl was confirmed by sequencing exon 3 using the BAC 
clone DNA as template. PCR products were cloned using the pGEM-T-easy T/A cloning kit. 

21.6.2 Zmaxl (T,RP5) knoclc-in/knock-out vector 

The organization of the genomic BAC clone 473P5 was characterized by Southern 
blot analysis using subcloned exon 1, exon 2 and exon 3 as probes and by sequencing the 
region spanning exon 1 through exon 5. Two different constructs were prepared for the 
Zmaxl KI/KO targeting. These constructs (A and B) differ only in flanking arms of 
homology. Construct (A) contains a 6.5 kb BstiLU-Xbal 5' arm of homology and a 1 kb 
Xbal-EcdBl 3' arm of homology; whereas, construct (B) contains a 1 kb 5' arm of homology 
and a 6.0 kb 3' arm of homology. The constructs were prepared by ligating short and long 
arms of homology to a LoxP flanked cassette containing the neomycin resistance gene (MC1- 



153 



WO 02/092000 



PCT/US02/14877 



Neo, Stratagene) and a synthetic transcriptional pause sequence (Promega). 

Both Zmaxl XKI/KO- targeting vectors (A and B) were modified to a G-to-T 
nucleotide substitution, encoding the G170V amino acid substitution, in exon 3. These 
modifications were introduced into by overlapping PCR mutagenesis using the wild type 
5 sequence of the short arm of homology as template. In addition, the 1 kb short arm of the 
Zmaxl -KI/KO (B) targeting vector was modified to include a 5' terminal Pmel restriction 
recognition site. The 5' overlapping fragment was made using the following forward primer 
and reverse mutagenic primer: 

Forward: 5'-AAGCTT GTTTA A AC TGGGCATGGTGGGACATGGTTGT A AT-V (SEQ 
10 ID NO: 741) 

Reverse: 5'-GGGCTTCCACCCAGTCAGTCCAGTACATGTACCT-3' (SEQ ID NO: 
742). 

The thermal cycling conditions utilized for thirty cycles are 95°C for 0.5 minute, 55°C for 1 
minute and 72°C for 1 minute. The 3' fragment was made using the following forward and 
15 reverse primers: 

Forward: 5 '-CTGACTGGG1GGAAGCACCCCGGATCGAGC-3 ' (SEQ ID NO: 

743) 

Reverse (mutagenic): 5'-GAATTCATCGGTACCTGTGCGGCCGCTTCATTG -3' 
(SEQ ID NO: 744). 

20 The thermal cycling conditions utilized for thirty cycles are 95°C for 0.5 minute, 55°C for 1 
minute and 72°C for 1 minute. The final overlapping PCR used 1 ml each of the 5 5 fragment 
and 3' fragment PCR reactions as template and amplification was performed using the 
forward and reverse primers of the 5 5 and 3' fragments respectively and the same thermal 
cycling parameters. The final PCR product was cloned using the pGEM-T-easy T/A cloning 

25 kit. The mutagenized exon 3 was excised from Zmaxl -KI/KO (B) and transferred to Zmaxl - 
KI/KO (A) as a 600 bp BsmBVXbal fragment. 

Probes for screening for and characterization of Zmaxl -KUKO (A) gene targeted ES 
cell clones are prepared by subcloning restriction fragments of BAC clone 473P5. The 5' 
outside probe is a 400 bp Nde-BsiETL fragment, and the 3' outside probe is a 500 bp EcoKL- 

30 BstXL fragment. 
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The outside probes for Zmaxl -KUKO (B) are prepared by PCR cloning genomic 
fragments flanking and immediately adjacent to the targeting vector region of homology. 
The 5' outside probe used for Zmaxl -KUKO (B) is a 498 bp fragment generated using the 
forward primer of the sequence (5 '-TGAGATGTCCTGTCTGTGGC-3 ' ; SEQ ID NO: 745) 
5 and a reverse primer of the sequence (5 5 -TCCTTCCTTCCCTACAGTTG-3 ? ; SEQ ID NO: 
746). The thermal cycling conditions utilized with these probes for thirty cycles are: 95°C 
for 0.5 minute, 55°C for 1 minute and 72°C for 1 minute. The 3 5 outside probe is a 600 bp 
fragment generated using the forward primer of the sequence (5'-CCTAAGGATCTCCTTGT 
GTCTGTGG-3'; SEQ ID NO: 747) and a reverse primer of the sequence (5'-CTGCAGCAG 
10 GTC AGT AGCCTGC-3 ' ; SEQ ID NO: 748). The thermal cycling conditions utilized with 
these probes for thirty cycles were: 95°C for 0.5 minute, 55°C for 1 minute and 72°C for 1 
minute. Both probes are specific for the Zmaxl gene in genomic southern analysis. PCR 
products are cloned using the pGEM-T-easy T/A cloning kit. 

A probe for ribonuclease protection analysis of Zmaxl mRNA structure and 
15 transcription levels was prepared by PCR cloning a cDNA fragment containing exon 3 

through exon 4. The PCR reaction used a complete cDNA as template, a forward primer of 
the sequence (5 5 -TGAGATGTCCTGTCTGTGGC-3 ' ; SEQ ID NO: 749), a reverse primer of 
the sequence (S'-TCCTTCCTTCCCTACAGTTG-S'; SEQ ID NO: 750) and the following 
thermal cycling conditions for thirty cycles; 95°C for 0.5 minute, 55°C for 1 minute and 72°C 
20 for 1 minute. The PCR product is cloned using the pGEM-T-easy T/A cloning kit. 

One skilled in the art could use similar protocols to generate other engineered KI- 
alleles to produce transgenic animals with an HBM-like phenotype. 

21.6.3 Gene targeting in ES cells 
25 For gene targeting, embryonic stem (ES) cells are electroporated with 50 mg of 

linearized targeting vector and selected in 200 mg/ml G418 for 7-10 days beginning the day 
after electroporation. G418 resistant clones are picked, expanded and cryopreserved. 
Resistant clones were screened for homologous recombination by an EcoRI genomic 
Southern restriction fragment length analysis using the 5* outside probe, which detects the 
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wild type and targeted alleles of Zmaxl as 4 kb and 5 kb fragments, respectively. Gene 
targeted ES cell clones are thawed, expanded, and characterized by Seal genomic restriction 
fragment length analysis using the 3' outside probe, which detects the wild type and targeted 
alleles of Zmaxl as 9 kb and 8 kb fragments, respectively. Gene targeted clones are also 
characterized by sequence analysis of Zmaxl exon 3 to ensure that the G to T substitution 
was included in homologous recombination. 

21.6.4 Production of gene targeted mice hy blastocyst injection 

To generate chimeric mice, gene targeted ES cell clones are thawed, expanded and 9- 
14 ES cells injected into the blastocoel of 3.5 post coitum (p.c.) host C57BL/6 blastocysts. 
Injected blastocysts (12-17) are then transferred unilaterally into the uterus of 2.5 p.c. 
pseudopregnant Swiss Webster embryo recipients and allowed to develop to term. Chimeric 
males are back-crossed to 129SvEv females and tested for transmission of the targeted allele 
by PCR geneotyping with primers specific to the neomycin resistance gene. 

21.6.5 In vitro deletion of the Neomycin resistance cassette vi a Cre recomhinase 
To generate Zmaxl (LRP5) KI mice from the Zmaxl KI/KO mice the Neomycin 

resistance (KO) cassette was deleted by micro injection of a Cre expressing plasmid (2 
mg/ml) into the male pronucleus of Zmaxl KI/KO pre-fusion zygotes. Deletion of the KO 
cassette was confirmed by PCR analysis of the cassette insertion site. 

21.6.6 Genotyping transgenic mice 

Genomic DNA was isolated from mouse tail snips by digestion in 500 ul buffer 
containing 50 mM Tris-HCl, (pH 7.2), 50 mM EDTA, (pH 8.0), 0.5% SDS and 0.8 mg/ml 
proteinase K. Samples are incubated at 55 °C with shaking overnight. A 10 ^1 aliquot was 
heat-inactivated at 99 °C for 5 minutes and diluted 1:20 in water. For PCR, 1 \xl of the diluted 
DNA was amplified under the following conditions: Denature: 96°C for 4.5 min; 45 cycles: 
96°C for 30 sec; 63°C for 1 min; 72°C for 1 min; Extension: 72°C for 5 min; 4°C hold. 

The following primer sets are used for genotyping: 
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HBMMCBA: 

5' primers: 296 hp fragment (SEQ ID NOS: 751-752 respectively) 

Forward: 5'-GCT TCT GGC GTG TGA CCG GCG-3 ' 

Reverse: 5'-GCC GCACAG CGC CAG CAG CAG C-3' 
3J primers: 400 hp fragment (SEQ ID NOS: 753-754 respectively) 

Forward: 5'-CAC CCA CGC CCC ACA GCC AGT A-3' 

Reverse: 5'-ATT TGC CCT CCC ATA TGT CCT TCC-3' 
HBMMTIC: 

5' primers: 382 hp fragment (SEQ ID NOS: 755-756 respectively) 

Forward: 5'-TTC CTC CCA GCC CTC CTC CAT CAG-3' 

Reverse: 5'-GCC GCA CAG CGC CAG CAG CAG C-3' 
V primers: 524 hp fragment (SEQ ID NOS: 757-758 respectively) 

Forward: 5'-GAA TGG CGC CCC CGA CGA C-3' 

Reverse: 5'-GCT CCC ATT CAT CAG TTC CAT AGG-3' 

21.6.7 Confirmation of genotype hy Southern analysis 

Mouse genomic DNA was digested with EcoBI and probed with a 1 .0 kb Sall-BamHL 
restriction fragment from the Zmaxl cDNA. The probe hybridizes to a 5 kb fragment in 
transgene positive animals. 

21.7 Phenotyping 

Both in vivo and ex vivo assays are used to evaluate the phenotype in transgenic mice. 
Two strains of wild-type mice, namely C57BL/6 and 129 SvEv, are studied to provide control 
data for phenotypic evaluation in transgenic and gene-targeted mice. In addition, non- 
transgenic littermate animals are used as controls. 

21.7.1 In vivo analysis 

Spinal bone mineral content (BMC) and bone mineral density (BMD) measurements 
performed at Creighton University (Omaha, Nebraska) were made by DXA using a Norland 
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Instruments densitometer (Norland XR2600 Densitometer, Dual Energy X-ray 
Absorptiometry, DXA). Spinal BMC and BMD at other locations used the machinery 
available. There are estimated to be 800 DXA machines currently operating in the U.S. Most 
larger cities have offices or imaging centers which have DXA capabilities, usually a Lunar or 
Hologic machine. Each location that provided spine BMC and BMD data included copies of 
the printouts from their machines to provide verification that the regions of interest for 
measurement of BMD have been chosen appropriately. Complete clinical histories and 
skeletal radiographs were obtained. 

The HBM phenotype (and HBM like phenotype which is also included when 
discussing the HBM phenotype) in human and animal subjects, preferably humans, can be 
described using criteria such as; very high spinal BMD; a clinical history devoid of any 
known high bone mass syndrome; and skeletal radiographs showing a normal shape of the 
appendicular skeleton. 

pDXA: Wild-type and transgenic mice are anesthetized, weighed and whole-body X- 
ray scans of the skeleton generated using the LUNAR small animal PDQmus device. Scans 
are begun when the mice are weaned (i.e., at 3 weeks of age) and repeated at 2 week 
intervals. Wild-type animals are scanned at 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 27, and 29 
weeks. Scanning of transgenic animals would be performed for periods up to 17 weeks. 
Scans are analysed for BMD (bone mineral density), BMC (bone mineral content), TTM 
(total tissue mass), and % fat for various body regions. 

Faxitron radiographs: Following pDXA scanning of anesthetized animals, an 
additional X-ray was taken using a Faxitron device allowing measurement of bone size. 

Calcein labeling: Animals are dosed with 15 mg/kg calcein intraperitoneally on two 
consecutive occasions. The first dose was given 9 days before euthanasia and the second 
given 2 days before euthanasia allowing measurement of bone formation rate. 

21.7.2 Ex vivo analysis 

RNA isolation: Total RNA was isolated from tibia and other tissues using TRIzol® to 
determine mRNA expression. 
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pQCT: The right femur was cleaned of soft tissue and stored in 70% ethanol for 
determination of total and trabecular density of the distal metaphysis and cortical density of 
the mid-shaft. 

MicroCT: The right femur was used to determine trabecular indices of the distal 
5 metaphysis. 

Histology: The right femur was used to determine bone area and static and dynamic 
parameters of the distal metaphysis. 

Bending strength: The left femur was cleaned of soft tissue and stored at -20 °C prior 
to analysis of 3 -point bending strength of the mid-shaft. 
10 Compressive strength of vertebra: The entire spine was removed from T10-L6-7. 

Soft tissue was left on and the spine frozen at -20°C until analysis. Compressive strength was 
measured at the L5 vertebra. 

Serum: Animals are euthanized and serum prepared from blood to measure total 
cholesterol, triglycerides, osteocalcin and other biochemical markers. 
15 Lysis: Examples include immunocytochemistry, such as in situ hybridization of 

osteogenic markers and TUNEL staining of cells undergoing apoptosis. 

21.8 Results 

21.8.1 Confirmation of ex pression from transgenic plasmid constructs 
20 The HBM (HBMMCBA and HBMMTIC) and wild-type (Zmaxl WTCB A and 

Zmaxl WTTIC) plasmid constructs were transiently transfected into HOB-02-02 cells, which 
have a very low endogenous level of Zmaxl (LRP5) expression. Two days after transfection, 
RNA was isolated and TaqMan® quantitative RT-PCR was performed to determine the 
mRNA levels of Zmaxl/HBM in the cells. To control for contaminating plasmid DNA, PCR 
25 was performed with or without the prior RT step, in the absence of the RT step, only very low 
levels of Zmaxl/HBM mRNA were detected. However, with the RT step, a 1000-fold 
increase in HBM and Zmaxl mRNA was observed in cells transfected with CMV&Actin- 
promoter constructs as compared to those transfected with the CMV-b-Galactosidase control. 
The type I collagen promoter constructs showed approximately 10-fold increases in HBM 
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and Zmaxl mRNAs, which is consistent with the weaker nature of this promoter compared to 
the CMVpActin promoter. See Fig. 17. 

21.8.2 Species specific Taq M an® reagents for HBM/Zmaxl expression 
5 Species specific TaqMan® primer and probe sets for Zmaxl/HBM were developed. 

In a series of experiments using HOB cells and mouse MC-3T3-E1 osteoblastic cells, 
Zmaxl/HBM mRNA was measured in a mouse background, and vise versa. These reagents 
useful for the detection and quantization of species-specific expression. As demonstrated in 
Fig. 18, the primer sets are species specific in the mouse and human cell lines. Further, Fig. 
10 19 demonstrates the quantitative measurement of human Zmaxl RNA in a background of 
mouse RNA. These TaqMan® sets can be used to determine the levels of human or mouse 
HBM or Zmaxl (or other HBM-like variant) message that are being expressed in the mouse 
transgenic lines. 

The species-specific TaqMan® reagents are novel tools for the characterization of both 
15 endogenous Zmaxl mRNA levels and human Zmaxl/HBM mRNA levels in the transgenic 

mouse tissues. These tools have several advantages over other conventional methods, such as 
Northern hybridization and standard RT-PCR. Some of these advantages are as follows: (1) 
specificity, since only a small region (<100 bps) is amplified primers and probes are chosen 
to sequence regions predicted to have no or minimal cross-reactivity; (2) speed, since the 
20 procedure is less labor intensive; (3) accuracy, since it is truly quantitative; and (4) 

sensitivity, since it requires only small amounts of starting material (i.e., RNA) and the 
signal-to-noise ratio is high. These advantages are especially important for analyzing mRNA 
levels in bone, because it is difficult to obtain large amounts of RNA from bone. Thus, the 
primer sets developed for TaqMan® analysis of a HBM and Zmaxl expression are important 
25 embodiments of the present invention. One skilled in the art will recognize that the primers 
described here are preferred embodiments; modifications such as extension or truncation of a 
primer or base substitution are encompassed by the present invention so long as the resultant 
nucleic acid continues to perform substantially the same function. 
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21.8.3 TTRM ex pression in transgenic mice 

Eight transgenic founder animals were produced for the CMV&Actin (HBMMCBA) 
construct and a breeding program initiated to establish lines. Expression of mRNA 
determined by TaqMan® analysis, shown in Fig. 20, showed variable levels of bone 
expression in 4 lines. In tibia, expression levels (relative to endogenous Zmaxl (LRP5) in 
HOB-03-C5 cells) showed the following range: line 18 (xl0-l 1 fold); line 2 (x7-10 fold); line 
13 (xl-2 fold) and line 28 (xl fold). Expression was also detected in other tissues as 
expected based on the known activity of the promoter. For lines 2 and 13, the highest levels 
of HBM expression were found in the heart. A TaqMan® genotyping assay will screen for 
potential homozygous animals. 

Six transgenic founder animals were produced for the type I collagen (HBMMTIC) 
construct, and a breeding program initiated to establish lines. Expression of mRNA was 
found in two lines initially tested. In line 19, expression was 7-8 fold and 19-20 fold greater 
than Zmaxl in HOB-03-C5 cells in tibia and femur, respectively. In line 35, a low level of 
expression was detected in tibia and femur. 

21.9 In vivo pDX A in HBM transgenic mice 

21.9.1 TTRMMCTBA construct 

Initial analysis of limited numbers of mice, illustrated in Fig. 21 (A-C), at the 5 week 
and 9 week time-points showed that one line tested to date had greater BMD values compared 
to control. At 5 weeks, HBMMCBA line 2 (n=l 1) BMD in femur, spine and total body was 
21%, 24% and 10% greater respectively, than wild-type control. At 9 weeks (n=3), these 
increases in BMD amounted to 19%, 32% and 12%, respectively. 

21.9.2 HRMMTTC construct 

Analysis of limited numbers of mice, illustrated in Fig. 21 (D-F), at the 5 week and 9 
week time points showed that two lines tested to date had greater BMD values as compared 
to control. At 5 weeks, HBMMTIC line 19 (n=5) BMD in femur, spine and total body was 
63%, 70% and 41% greater respectively than the wild-type control. At 9 weeks (n=2), these 
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increases in BMD amounted to 52%, 64% and 37% respectively. 

At 5 weeks, HBMMTIC line 35 (n=l) BMD in femur, spine and total body was 4%, 
47% and 6% greater, respectively than the wild-type control. At 9 weeks (n=3), these 
increases in BMD amounted to 32%, 43% and 18% respectively. 

These results indicate that three lines analyzed to date show evidence for a high bone 
mass phenotype. There appeared to be no obvious correlation between levels ofmRNA 
expression and BMD phenotype from the limited numbers of lines and animals studied to 
date. For example, HBMMCBA line 2 and HBMMTIC line 19 have similar levels of HBM 
mRNA in tibia, but the phenotype is more evident in line 19. Also, HBMMTIC line 35 
shows a very low level of expression when compared to HBMMCBA line 2, but HBMMTIC 
line 35 appears to have a stronger phenotype. These observations point to possible 
differences in cellular expression that may impact the phenotype. 

Overall, the BMD results from the transgenic mice show similarities in magnitude to 
the phenotype observed in the HBM affected kindred (Johnson et ah, 1997, Am. J. Hum. 
Genetics, 60:1326-1332). For example, spinal BMD measured in affected individuals is 
approximately 34-63% greater than non-affected family members. The data for spinal BMD 
from the transgenic animals ranges from -30-70% greater than normal at 9 weeks of age. 

21.9.3 Ex v ivo analysis of transgenic mice. 

In order to further examine increases in bone density that were detected in select 
transgenic lines through monitoring of the animals by non-invasive bone imaging, necropsies 
were performed on animals of these lines at 5 and 9 weeks of age for direct bone 
densitometric and histologic analysis. The left femur was isolated, cleaned and positioned in 
an XCT Research peripheral Quantitative Computed Tomograph (pQCT; Stradtec 
Medizintechnik, Pforzheim, Germany). The distal end of the femur was located and pQCT 
scanning was initiated 2.5 mm proximal from this point for total and trabecular 
measurements. The pQCT scan for cortical measurements was initiated 3.5 mm proximal 
from the first scan (i.e., 6 mm proximal from the distal end). The pQCT scans were 0.5 mm 
thick, had a voxel (i.e., three dimensional pixel) size of 0.07 mm, and consisted of 360 
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projections through the slice. After the scans were completed, the images were displayed on 
the monitor and a region of interest, including the entire femur for each scan, was outlined. 
The soft tissue was automatically removed using an iterative algorithm, and the density of the 
remaining bone (total density) in the first slice was determined. The outer 55% of the bone 
was then peeled away in a concentric spiral and the density of the remaining bone (trabecular 
density) of the first slice was reported in mg/cm 3 . In the second slice, the boundary between 
cortical and trabecular bone was determined using an iterative algorithm, and the density of 
the cortical bone was determined. 

Analysis of Line 2 Fl CMVZ>Actin-HBM 5 week old transgenic animals revealed that 
total density, trabecular density and cortical density, were 20%, 37% and 4% higher, 
respectively, in the transgenic male mice versus the non-transgenic males. In 5 week old 
Line 19 type I collagen-HBM transgenic males, an even more dramatic increase in bone 
density over their non-transgenic littermates was evident. Total density, trabecular density 
and cortical density were 53%, 104 % and 5% higher, respectively. In the Line 19 animals, 
the phenotype was found to be maintained at 9 weeks of age with total, trabecular and cortical 
bone density being 44%, 101% and 6% higher than the non-transgenic littermates. At 17 
weeks, total and trabecular density were increased 46%, 202%, respectively. The effects on 
the total trabecullar parameters in line 19 at all three time points were statistically 
significantly higher (p<0.001). A somewhat different pattern of bone phenotypic expression 
was evident from males of type I collagen-HBM transgenic Line 35. At 5 weeks of age total, 
trabecular and cortical density were only marginally higher (7%, 4% and 4%, respectively). 
However, at 9 weeks of age a clear and statistically significant (P < 0.05, 0.005, and 0.001 
respectively) increase in these parameters became evident (i.e., 25%, 37% and 4%, 
respectively). The occurrence of different patterns of age-related expression of the phenotype 
is not unexpected, particularly with the "bone specific" type I collagen transgene, which is 
influenced by stage of bone cell differentiation. Both Line 2 and Line 19 animals at 5 weeks 
of age express comparable levels of HBM mRNA in tibia samples, and these levels are 
significantly greater (>7-8-fold) than other lines that show no apparent bone phenotype at this 
age. Line 19, which is driven by the type I collagen promoter, unlike line 2, shows very low 
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expression of the transgene in tissues other than bone. At 5 weeks of age, Line 35 animals 
show low level expression in bone and none in other tissues, Immunohistochemistry of 
calvarial bone sections using an HBM/Zmaxl specific antibody reveals much more intense 
staining in bone cells of transgenic animals from Lines 2 and 19 at 5 weeks of age and from 
Line 35 at 9 weeks of age versus their non-transgenic littermates. 

The findings revealed by pQCT analysis were further examined under greater 
resolution using ^CT instrumentation (Scanco). The femur was positioned such that the 
region being imaged includes the distal end of the femur extending approximately 4 mm 
proximally with the view being perpendicular to the axis of the articulating cartilage. The 
reference line for beginning the /uCT measurement was placed to minimally overlap the 
growth plate and extends proximally for 200 scan slices (9 mm thickness). After completing 
the plCT measurement, the first slice in which the condyles have fully merged was identified. 
A region of interest was outlined to include a maximum amount of the trabecular space, while 
excluding the cortex. For the first thirty slices, regions of interest were drawn every five 
slices and merged. For the remaining 105 slices, regions of interest were drawn every 10-20 
slices. The more regular the trabecular space, the less frequently a region of interest needed 
to be drawn. Each region of interest was merged with its predecessor after it was drawn. 
After regions of interest had been established for all 135 slices, three dimensional evaluation 
was performed using a threshold setting of 350. 

The increased bone densities identified by pQCT were confirmed and extended by 
t*CI to include elements of bone architecture. In the Line 2 transgenic animals, ,uCT bone 
volume/total volume, connectivity density and trabecular thickness were 50%, 83% and 12% 
higher, respectively. Both the connectivity density and trabecular thickness indices suggest 
that the increased density is also associated with increased structural strength. Bone surface/ 
bone volume was lower by 17% in the transgenic males, which may suggest that there may be 
fewer resorptive surfaces and pits. The trabecular bone response was further confirmed by 
histological evaluation of non-decalcified, Goldner's stained sections, which revealed 36% 
greater bone mineral area in the distal femoral metaphysis of me transgenic males. Dynamic 
histomorphometric analysis revealed that a substantial increase in bone mineral apposition 
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rate (+100%), as determined by calcein double labeling, may be partially responsible for the 
increased bone in the transgenics. The dramatic effects evident by pQCT on trabecular bone 
in Line 19 were supported by fiCT evaluation where bone volume/total volume, trabecular 
number, trabecular thickness and connectivity density were found to be 130%, 45%, 30% and 
5 121% higher, respectively, in the transgenic males. As seen in Line 2, the bone surface/ bone 
volume was lower in the transgenic males (-36%). All of these effects were statistically 
significant with p < 0.01. The bone phenotype seen at 5 weeks of age in Line 19 was 
maintained in 9 week-old animals where bone volume/total volume, trabecular number and 
connectivity density were 125%, 38% and 1 10% higher than in the non-transgenic 
10 littermates. 

/uCT analysis of the Line 35 transgenics revealed a somewhat different pattern than 
the other two lines. In contrast to only modestly increased density indicated by pQCT in 5 
week-old females from Line 35, a statistically significant effect (p < 0.01) was seen with 
mCT, which has greater image resolution and encompasses a larger volumetric sample. Bone 

1 5 volume/total volume, trabecular thickness and connectivity density were 35%, 9% and 27% 
higher. A similar result was seen in 5 week-old males from Line 35 where bone volume/total 
volume and connectivity density increases of 37% and 45%, respectively, were evident by 
juCT analysis, where only slight increases were revealed by pQCT. The differences between 
the Line 35 transgenic males and their non-transgenic littermates appeared to increase with 

20 age such that statistically significant increases in total density (25%) and trabecular density 
(37%) were evident by pQCT at 9 weeks of age. The //CT results support an age-related 
divergence in bone phenotype in this line and show that differences between transgenic and 
non-transgenic animals, in terms of bone volume/total volume and connectivity density, had 
nearly doubled those seen at 5 weeks to 70% and 83%, respectively. The bone volume 

25 increases seen in the transgenic animals is in agreement with a significant increase in this 

parameter that was detected in a bone biopsy sample from an adult male affected member of 
the HBM kindred. The other parameters that were found to be affected in the transgenic lines 
may reflect changes that lead to an increased peak/adult bone mass, which in this strain of 
mice occurs between the ages of 17-20 weeks 
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The bone density and bone architectural changes seen in the over-expressing 
transgenic lines would suggest potentially greater bio-mechanical strength. This was tested 
directly by evaluating 3-point bending strength of femurs from 5 week old Line 19 males. 
The femora were cleaned of soft tissue and the femoral length measured using a digital 
caliper. Periosteal and endosteal circumferences, as well as cortical thickness, were measured 
6 mm from the distal end of the bone using pQCT. The femur was then placed on a fixture so 
that the center of mid-shaft was at an equal distance from fixed supports located 5 mm apart. 
The cross bar of an Instron 5543 load device was placed over the mid-shaft and a force 
applied at a speed of 1 mm/minute until fracture occurred. A force vs. displacement curve 
was generated and peak load determined using Instron Merlin software. There was a 75% 
increase (p<0.01) in strength that appears to be due to an increase in periosteal circumference 
leading to an increase in cortical thickness. Thus, it appears that the changes in bone density 
and bone geometry, as seen in the HBM transgenic animals, do translate into increases in 
biomechanical strength. 

In view of the association of HBM/Zmaxl within the class of LDL related receptor 
proteins, it was of interest to determine whether the mutation might affect lipid profiles. 
Indeed, lipid studies in the HBM kindred (i.e., 8 affected and 7 unaffected members) have 
revealed that triglyceride and VLDL levels are statistically lower in the affected members. 
Serum samples from the transgenic lines were analyzed on a Hitachi 91 1 instrument using 
Boehringer Mannheim (for Cholesterol) and Roche (for triglycerides) reagents. The 
cholesterol was measured via o-quinone imine dye (which is formed following enzymatic 
reactions with cholesterol) photometrically at 505 nm at 37°C Enzymatic methods for 
triglyceride measurements are based on determination of the glycerol part of triglyceride after 
hydrolysis of triglycerides and fatty acids. The end dye product of enzymatic reaction was 
measured at 505 nm. In 5 week old male Line 2 transgenics, although serum cholesterol was 
only slightly reduced, serum triglyceride levels were reduced by 26% in the transgenics 
versus their non-transgenic littermates. In a limited sample of Line 2 animals at 9 weeks of 
age, triglyceride levels remained 20% lower. Similarly, at 5 weeks of age triglyceride levels 
in male transgenics from Line 19 were 32% lower. In contrast, at 5 weeks of age both male 
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and female transgenics of Line 35 did not have lower triglyceride levels. The fact that the 5 
week old Line 35 animals did have statistically greater bone volume/total volume suggests 
that the lipid change may not be directly related to the skeletal phenotype. This would appear 
to be supported by the fact that the Line 35 animals at 9 weeks of age had only slightly 
5 reduced triglyceride levels (1 1%) but exhibited substantially higher bone density than at 5 
weeks of age. Due to the different levels and sites of expression of the transgene in these 
lines we can not rule out the possibility that serum lipid levels could serve as a surrogate 
marker for agents favorably affecting a bone phenotype through HBM/Zmaxl . 

These and other transgenic lines based on HBM or HBM-like genes will serve as 

1 0 valuable models for exploring the nature of bone homeostasis. Bone density in all species 
accommodates to its customary loading conditions. In the HBM kindred and in the 
transgenic lines, the sensor/effector systems of the skeleton appear to perceive greater load 
signals resulting in greater bone density. Experimental models have been established 
showing that increased bone loading can lead to increased bone density and that unloading or 

15 disuse leads to a loss of bone density. Evaluating the histological, biochemical and genetic 

responses of the skeleton of the transgenic animals in these experimental paradigms will yield 
much information regarding the sensor/effector system responsible for bone homeostasis. 
The application of the transgenic animals in other established models of altered bone 
turnover, including but not limited to steroid deficiency-induced osteopenia and aging-related 

20 osteopenia will provide further insight into the role of Zmaxl in bone homeostasis and the 
nature of the favorable changes induced by the HBM mutation. 

21.9.4 gene-targeting 

The Zmaxl KI/KO gene-targeting vector is electroporated into 129 SvEv, C57BL/6 
25 ES and 129 ES cells. Restriction fragment length analysis of genomic DNA and sequencing 
of PCR amplified fragments can be used to identify gene targeted clones. The knock-in 
version of the gene-targeting vector allows for the introduction of the HBM mutation into the 
endogenous Zmax (LRP5) genomic locus with minimal impact on the mouse genome. It 
permits the production of the HBM protein in a more natural environment, i.e. not in an over- 
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expression model such as the transgenic mice or transfected cell lines. The knock-out version 
of the gene-targeting vector was engineered to contain a transcriptional stop sequence that has 
the potential to result in loss of one functional LRP5 allele. Breeding heterozygous animals 
with this mutation would lead to the production of embryos homozygous for the null allele. 
In a different design of the gene-targeting vector, lox P sites could be positioned in such a 
way that would facilitate production of a conditional knock-out of the endogenous LRP5 
gene. In the presence of Cre recombinase, a critical region of the LRP5 gene would be 
deleted in between the lox P sites, thus resulting in the potential loss of one functional allele. 
Animal breeding would then be used to create homozygotes with a null allele. Other 
recombinase enzyme systems, such as flp recombinase hi combination with cognate frt sites, 
could be used to create the deletion. The recombinase could be administered in a number of 
ways as described earlier, including plasmid injection into embryos and use of transgenic 
animals expressing Cre. The promoter used to drive expression of Cre could be chosen in a 
manner that would result in ubiquitous or tissue-specific deletion of the LRP5 gene thus 
resulting in a conditional knockout. In a further embodiment expression of the Cre enzyme 
itself could be made conditional using inducible systems such as GeneSwitch and 
Tetracycline paradigms. 

21.9.5 Uses of transgenic animals and cells 

The transgenic animals and cells of the present invention are useful tools in methods 
for identifying surrogate markers for the HBM phenotype. The surrogate markers provided 
by the present invention are also useful tools for the assessment and screening of prospective 
treatments. Individuals carrying the HBM gene have elevated bone mass. The HBM gene 
causes this phenotype by altering the activities, levels, expression patterns, and modification 
states of other molecules involved in bone development. Using a variety of established 
techniques, it is possible to identify molecules, preferably proteins or mRNAs, whose 
activities, levels, expression patterns, and modification states are different between systems 
containing the Zmaxl gene and systems containing the HBM gene. Such systems can be, for 
example, cell-free extracts, cells, tissues or living organisms, such as mice or humans. For a 
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mutant form of Zmaxl, a complete deletion of Zmaxl, mutations lacking the extracellular or 
intracellular portion of the protein, or any other mutation in the Zmaxl gene may be used. It 
is also possible to use expression of antisense Zmaxl RNA or oligonucleotides to inhibit 
production of the Zmaxl protein. For a mutant form of HBM, a complete deletion of HBM, 
5 mutations lacking the extracellular or intracellular portion of the HBM protein, or any other 
mutation in the HBM gene may be used. It is also possible to use expression of antisense 
HBM RNA or oligonucleotides to inhibit production of the HBM protein. 

Molecules identified by comparison of Zmaxl systems and HBM systems can be used 
as surrogate markers in pharmaceutical development or in diagnosis of human or animal bone 

10 disease. Alternatively, such molecules may be used in treatment of bone disease. See, 
Schena et al. 9 1995 Science, 270: 467-70. 

For example, a transgenic mouse carrying the HBM gene or HBM-like variant in the 
mouse homologue locus is constructed. A mouse of the genotype HBM/+ is viable, healthy 
and has elevated bone mass. To identify surrogate markers for elevated bone mass, HBM/+ 

15 (i.e., heterozygous) and isogenic +/+ (i.e., wild-type) mice are sacrificed. Bone tissue mRNA 
is extracted from each animal, and a "gene chip" corresponding to mRNAs expressed in the 
+/+ individual is constructed. mRNA from different tissues is isolated from animals of each 
genotype, reverse-transcribed, fluorescently labeled, and then hybridized to gene fragments 
affixed to a solid support. The ratio of fluorescent intensity between the two populations is 

20 indicative of the relative abundance of the specific mRNAs in the +/+ and HBM/+ animals. 
Alternatively, mRNA may be isolated from wild-type and transgenic animals. cDNA 
prepared from these samples is transcribed in vitro to obtain labeled mRNA for use on 
custom made or commercially available gene array chips such as are manufactured by 
Affymetrix. Sets of genes with altered expression as a function of phenotype may be 

25 identified be a variety of routine computational analyses. Genes encoding mRNA over- and 
under-expressed relative to the wild-type control are candidates for genes coordinately 
regulated by the HBM gene or HBM-like gene. 

One standard procedure for identification of new proteins that are part of the same 
signaling cascade as an already-discovered protein is as follows. Cells are treated with 



169 



WO 02/092000 



PCT/US02/14877 



radioactive phosphorous, and the already-discovered protein is manipulated to be more or less 
active. The phosphorylation state of other proteins in the cell is then monitored by 
polyacrylamide gel electrophoresis and autoradiography, or similar techniques. Levels of 
activity of the known protein may be manipulated by many methods, including, for example, 
5 comparing wild-type mutant proteins using specific inhibitors such as drugs or antibodies, 
simply adding or not adding a known extracellular protein, or using antisense inhibition of 
the expression of the known protein (Tamura et al, 1998 Science, 280: 1614-7; Meng, 1998 
EMBOJ., 17: 4391-403; Cooper et al, 1982 Cell 1: 263-73). 

In another example, proteins with different levels of phosphorylation are identified in 

10 TE85 osteosarcoma cells expressing either a sense or antisense cDNA for Zmaxl . TE85 cells 
normally express high levels of Zmaxl (Dong et al, 1998 Biochem. & Biophys. Res. Comm., 
25 1 : 784-90). Cells containing the sense construct express even higher levels of Zmax 1 , 
while cells expressing the antisense construct express lower levels. Cells are grown in the 
presence of 32 P, harvested, lysed, and the lysates run on SDS polyacrylamide gels to separate 

15 proteins, and the gels subjected to autoradiography (Ausubel et al, Current Protocols in 

Molecular Biology, John Wiley & Sons (1997)). Bands that differ in intensity between the 
sense and antisense cell lines represent phosphoproteins whose phosphorylation state or 
absolute level varies in response to levels of Zmaxl. As an alternative to the 32 P-labeling 5 
unlabeled proteins may be separated by SDS-PAGE and subjected to immunoblotting, using 

20 the commercially available anti-phosphotyrosine antibody as a probe (Thomas et al, 1995 
Nature, 376: 267-71). As an alternative to the expression of antisense RNA, transfection 
with chemically modified antisense oligonucleotides can be used (Woolf et al, 1990 Nucleic 
Acids Res., 18: 1763-9). 

Many bone disorders, such as osteoporosis, have a slow onset and a slow response to 

25 treatment. It is therefore useful to develop surrogate markers for bone development and 

mineralization. Such markers can be useful in developing treatments for bone disorders, and 
for diagnosing patients who may be at risk for later development of bone disorders. 
Examples of preferred markers are N- and C-terminal telopeptide markers described, for 
example, in U.S. Patent Nos. 5,455,179, 5,641,837 and 5,652,1 12, the disclosures of which 
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are incorporated by reference herein in their entirety. In the area of HIV disease, CD4 counts 
and viral load are useful surrogate markers for disease progression (Vlahov et al, 1998 
JAMA, 279(1): 35-40). There is a need for analogous surrogate markers in the area of bone 
disease. 

5 A surrogate marker can be any characteristic that is easily tested and relatively 

insensitive to non-specific influences. For example, a surrogate marker can be a molecule 
such as a protein or mRNA in a tissue or in blood serum. Alternatively, a surrogate marker 
may be a diagnostic sign such as sensitivity to pain, a reflex response or the like. 

In yet another example, surrogate markers for elevated bone mass are identified using 

10 a pedigree of humans carrying the HBM gene or HBM-likc gene. Blood samples are 

withdrawn from three individuals that carry the HBM gene or HBM-like gene, and from three 
closely related individuals that do not. Proteins in the serum from these individuals are 
electrophoresed on a two dimensional gel system, in which one dimension separates proteins 
by size, and another dimension separates proteins by isoelectric point (Epstein et ah, 1996 

15 Electrophoresis 17: 1655-70). Spots corresponding to proteins are identified. A few spots 
are expected to be present in different amounts or in slightly different positions for the HBM 
individuals compared to their normal relatives. These spots correspond to proteins that are 
candidate surrogate markers. The identities of the proteins are determined by 
microsequencing, and antibodies to the proteins can be produced by standard methods for use 

20 in diagnostic testing procedures. Diagnostic assays for HBM proteins or other candidate 
surrogate markers include using antibodies described in this invention and a reporter 
molecule to detect HBM in human body fluids, membranes, bones, cells, tissues or extracts 
thereof. The antibodies can be labeled by joining them covalently or noncovalently with a 
substance that provides a detectable signal. In many scientific and patent literature, a variety 

25 of reporter molecules or labels are described including radionuclides, enzymes, fluorescent, 
chemiluminescent or chromogenic agents (U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 
3,996,345; 4,277,437; 4,275,149; and 4,366,241). The transgenic or genetically modified 
animals can also serve in a method for surrogate marker identification. 

Using these antibodies, the levels of candidate surrogate markers are measured in 
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normal individuals and inpatients suffering from a bone disorder, such as osteoporosis, 
osteoporosis pseudoglioma, Engelmanris disease, Ribbing's disease, hyperphosphatasemia, 
Van Buchem's disease, melorheostosis, osteopetrosis, pychodysostosis, sclerosteosis, 
osteopoikilosis, acromegaly, Paget's disease, fibrous dysplasia, tubular stenosis, osteogenesis 
imperfecta, hypoparathyroidism, pseudohypoparathyroidism, pseudopseudohypoparathyroid- 
ism, primary and secondary hyperparathyroidism and associated syndromes, hypercalciuria, 
medullary carcinoma of the thyroid gland, osteomalacia and other diseases. Techniques for 
measuring levels of protein in serum in a clinical setting using antibodies are well 
established. A protein that is consistently present in higher or lower levels in individuals 
carrying a particular disease or type of disease is a useful surrogate marker. 

A surrogate marker can be used in diagnosis of a bone disorder. For example, 
consider a child that presents to a physician with a high frequency of bone fracture. The 
underlying cause may be child abuse, inappropriate behavior by the child, or a bone disorder. 
To rapidly test for a bone disorder, the levels of the surrogate marker protein are measured 
using the antibody described above. 

Levels of modification states of surrogate markers can be measured as indicators of 
the likely effectiveness of a drug that is being developed. It is especially convenient to use 
surrogate markers in creating treatments for bone disorders, because alterations in bone 
development or mineralization may require a long time to be observed. For example, a set of 
bone mRNAs, termed the "HBM-inducible mRNA set" is found to be over-expressed in 
HBM/+ mice as compared to +/+ mice, as described above. Expression of this set can be 
used as a surrogate marker. Specifically, if treatment of +/+ mice with a compound results in 
over-expression of the HBM-inducible mRNA set, then that compound is considered a 
promising candidate for further development. 

This invention is particularly useful for screening compounds by using the Zmaxl or 
HBM protein or HBM-like proteins or binding fragments thereof in any of a variety of drug 
screening techniques. 

The Zmaxl or HBM protein or fragment employed in such a test may either be free in 
solution, affixed to a solid support, or borne on a cell surface. One method of drug screening 
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utilizes eukaxyotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic acids expressing the protein or fragment, preferably in competitive binding assays. 
Such cells, either in viable or fixed form, can be used for standard binding assays. One may 
measure, for example, for the formation of complexes between a Zmaxl or HBM protein or 
fragment and the agent being tested, or examine the degree to which the formation of a 
complex between a Zmaxl or HBM protein or fragment and a known ligand is interfered with 
by the agent being tested. 

Thus, the present invention provides methods of screening for drugs comprising 
contacting such an agent with a Zmaxl (LRP5) or HBM protein or fragment thereof and 
assaying (i) for the presence of a complex between the agent and the Zmaxl or HBM protein 
or fragment, or (ii) for the presence of a complex between the Zmaxl or HBM protein or 
fragment and a ligand, by methods well known in the art. In such competitive binding assays 
the Zmaxl or HBM protein or fragment is typically labeled. Free Zmaxl or HBM protein or 
fragment is separated from that present in a proteimprotein complex, and the amount of free 
(i.e., uncomplexed) label is a measure of the binding of the agent being tested to Zmaxl or 
HBM or its interference with Zmaxl or HBM: ligand binding, respectively. 

Another technique for drug screening provides high throughput screening for 
compounds having suitable binding affinity to the Zmaxl or HBM proteins and is described 
in detail in WO 84/03564. Briefly stated, large numbers of different small peptide test 
compounds are synthesized on a solid substrate, such as plastic pins or some other surface. 
The peptide test compounds are reacted with Zmaxl or HBM proteins and washed. Bound 
Zmaxl or HBM protein is then detected by methods well known in the art. Purified Zmaxl 
or HBM can be coated directly onto plates for use in the aforementioned drug screening 
techniques. However, non-neutralizing antibodies to the protein can be used to capture 
antibodies to immobilize the Zmaxl or HBM protein on the solid phase. 

This invention also contemplates the use of competitive drug screening assays in 
which neutralizing antibodies capable of specifically binding the Zmaxl or HBM protein 
compete with a test compound for binding to the Zmaxl or HBM protein or fragments 
thereof. In this manner, the antibodies can be used to detect the presence of any peptide that 
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shares one or more antigenic determinants of the Zmaxl or HBM protein. 

A further technique for drug screening involves the use of host eukaryotic cell lines or 
cells (such as described above) that have a nonfunctional Zmaxl or HBMgtnQ. These host 
cell lines or cells are defective at the Zmaxl or HBM protein level. The host cell lines or 
cells are grown in the presence of drug compound. The rate of growth of the host cells is 
measured to determine if the compound is capable of regulating the growth of Zmaxl or 
HBM defective cells. 

The goal of rational drug design is to produce structural analogs of biologically active 
proteins of interest or of small molecules with which they interact (e.g., agonists, antagonists, 
inhibitors) in order to fashion drugs which are, for example, more active or stable forms of 
the protein, or which, e.g., enhance or interfere with the function of a protein in vivo. See, 
e.g., Hodgson, 1991 Bio/Technology, 9: 19-2 1. In one approach, one first determines the 
three-dimensional structure of a protein of interest (e.g., Zmaxl or HBM protein) or, for 
example, of the Zmaxl- or HBM-receptor or ligand complex, by x-ray crystallography, by 
computer modeling or most typically, by a combination of approaches. Less often, useful 
information regarding the structure of a protein may be gained by modeling based on the 
structure of homologous proteins. An example of rational drug design is the development of 
HIV protease inhibitors (Erickson et al, 1990 Science, 249: 527-33). In addition, peptides 
(e.g., Zmaxl or HBM protein) are analyzed by an alanine scan (Wells, 1991 Methods in 
Enzymol , 202: 390-41 1). In this technique, an amino acid residue is replaced by Ala, and its 
effect on the peptide's activity is determined. Each of the amino acid residues of the peptide 
is analyzed in this manner to determine the important regions of the peptide. 

It is also possible to isolate a target-specific antibody, selected by a functional assay, 
and then to solve its crystal structure. In principle, this approach yields a pharmacore upon 
which subsequent drug design can be based. It is possible to bypass protein crystallography 
altogether by generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically 
active antibody. As a mirror image of a mirror image, the binding site of the anti-ids would 
be expected to be an analog of the original receptor. The anti-id could then be used to 
identify and isolate peptides from banks of chemically or biologically produced banks of 
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peptides. Selected peptides would then act as the pharmacore. 

Thus, one may design drugs which have, e.g., improved Zmaxl or HBM protein 
activity or stability or which act as inhibitors, agonists, antagonists, etc. of Zmaxl or HBM 
protein activity. By virtue of the availability of cloned Zmaxl or HBM sequences, sufficient 
5 amounts of the Zmaxl or HBM protein may be made available to perform such analytical 
studies as X-ray crystallography. In addition, the knowledge of the Zmaxl or HBM protein 
sequence provided herein will guide those employing computer modeling techniques in place 
of, or in addition to X-ray crystallography. 

Identified drug candidates (known as "leads") may be further studied by use of 

10 transgenic animals. The transgenic animals of the present invention are useful for creating an 
animal model of bone density modulation which may be used to test and refine drug leads. 
The transgenic animals described above represent a single example of the Zmaxl and HBM 
or HBM-like transgenic animals contemplated herein. One skilled in the art is aware of 
variations and the considerations which will be routinely applied in modifying the present 

15 invention to a specific purpose. Examples of the development of transgenic animal models 
are given for example in Strate gies in Transgenic Animal Science (1995, Monastersky and 
Robl Eds., Washington, DC: American Society for Microbiology) and references therein 
which are all incorporated herein by reference in their entirety. 

As an example, at least two groups of transgenic animals can be created as described, 

20 so that one group expresses HBM and another group expresses Zmaxl. These animals can be 
treated with the candidate drug for some time spanning from a few days to the remainder of 
the animal's life-span. The animals are monitored for changes in bone mass and/or surrogate 
markers for the HBM phenotype. The transgenic animals used in such a study may express 
human HBM protein and Zmaxl protein or the homologous HBM and Zmaxl proteins 

25 defined for each species or variants thereof. Expression may be driven by a ubiquitous 

promoter or a bone specific promoter as would be known. It will be informative to compare 
groups of animals utilizing different promoters. 

The transgenic animals of the method according to the present inventions may also 
comprise knock-in (KI) and/or knock-out (KO) animals, such as mice, which express HBM, 
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Zmaxl, or neither under the control of the animal's native promoter. Such animals may be 
created by homologous recombination in ES cells as described above (and elsewhere in the 
literature of the art such as, for example, U.S. Patent Nos. 6,187,991 and 6,187,992 and 
references cited therein which are incorporated herein in their entirety). The experimental 

5 groups of transgenic animals treated with candidate drugs may be monitored by non-invasive 
means, by the monitoring of surrogate markers as described above, and/or by ex vivo analysis 
of bones from sacrificed animals at given time-points. 

Likewise the effect of such treatments as dietary control (e.g. varying intake of 
vitamins, minerals, proteins, lipids, etc.), ovariectomy, direct administration of all or part of 

10 purified HBM or Zmaxl proteins, administration of antisense nucleotides, antibodies against 
Zmaxl or gene therapy in adults may be investigated by systematic administration of the 
treatment to transgenic animals according to the invention. Such treatments may include, 
administration of estrogens, tamoxifen, raloxifene, (or other selective estrogen modulators, 
SERMs), vitamin D analogs, calcitonin, cathepsin K inhibitors, statins (e.g. simvastatin, 

1 5 pravastatin, and lovastatin), bis-phosphonates, parathyroid hormone (PTH), bone 

morphogenetic proteins (BMP) as described in U.S. Patent Nos. 6,190,880 and 5,866,364, 
and combinations of the above compounds. 

In view of the homology between Zmaxl and LRP6 and to the LDL receptor and the 
further observations of markers for cardiac health being modulated in HBM subjects, it is an 

20 aspect of the present invention to use the novel research methods disclosed herein to screen 
known cardioprotective treatments for bone modulating effects. Thereby, the present 
invention provides therapeutic methods which are both cardioprotective and which improve 
bone quality. The models are useful for testing drugs and researching lipid modulation 
effects related to Zmaxl, LRP6, HBM and HBM like proteins and nucleic acids. 

25 The effect of various mutations of Zmaxl, HBM and HBM-like genes may be 

investigated by creation of additional lines of transgenic animals according to the invention, 
wherein these animals comprise such mutations. By comparison of direct measures of bone 
development or surrogate markers, an embodiment of the invention provides a useful research 
tool for screening gene therapy reagents, candidate drug therapies, and elucidating molecular 
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mechanisms of bone development modulation. One skilled in the art knows how to use the 
methods of the present invention to achieve these goals. 

The present invention provides a method and useful research tools for testing 
prospective gene therapies. Transgenic knock-out mice are useful for testing prospective 
5 gene therapies. A transgenic knock-out animal such as a mouse is created as described 
above which does not express endogenous Zmaxl or HBM. A prospective gene therapy, 
such as intravenous injection of a recombinant replication-defective adenovirus encoding the 
human HBM protein driven by the CMVpActin promoter, is administered. Parameters of 
bone density and/or surrogate markers are monitored over time following therapy (Ishibashi 

10 et aL, 1993 J. Clin. Invest 92: 883-93) A TaqMan® primer set such as that described above 
may be used to measure expression of transgenic HBM. One skilled in the art knows 
alternative methods such as the Northern blot method. Comparison of treated and untreated 
animals both within and between groups of germ-line transgenic animals, knock-out (null 
allele) background, and wild-type endogenous Zmaxl background animals provides 

15 complementary controls for assessing the relative effectiveness of various modalities of gene 
therapy. 

Uses for the transgenic animals models contemplated herein also include, but are not 
limited to: (1) sources for generating bone cell cultures from the calvaria of the transgenic 
animals to study bone cell (e.g., osteoblast and osteoclast) function and number; (2) models 

20 for studying the effects of estrogen loss by ovariectomizing (ovx) the transgenic animals; (3) 
models for testing mechanical loading on the bones and other stress/strength tests; (4) 
breeding models with which to breed to other genetically modified or naturally occurring 
mutant animals that display bone abnormalities; (Chipman et aL, 1993 PNAS, 90: 1701-05; 
Phillips et aL, 2000 Bone 27: 219-26; Kajkenova et aL, 1997 J. Bone Min. Res. 12: 1772-79; 

25 Jilka et aL , 1 996 J. Clin. Invest. 97: 1 732-40; Takahashi et aL , 1 994 Bone and Mineral 24: 
245-55); (5) bone mis/disuse models to test the effects of weight bearing or gravity; (6) 
models for identifying and screening reagents which may or are known to modulate bone 
metabolism (e.g., PTH, estrogen, vitamin D analogs, bisphosphonates, statins, leptin, BMP, 
apoE); (7) models for investigating prospective treatments to improve fracture repair. 
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The transgenic animal models can be analyzed using, but not limited to, such methods 
as bone densitometry by pDEXA, pQCT and microCT; histology, molecular marker analysis, 
apoptosis, cell proliferation, cell cycle, mineralization, serum biochemistry, transcriptional 
profiling, and the like. 

5 

22. Methods of Use: Avian and Mammalian Animal Husbandry 

The Zmaxl (LRP5) DNA and Zmaxl (LRP5) protein and/or the HBM DNA and 
HBM protein (or LRP6 or an HBM-like nucleic acid or protein as also contemplated herein) 
can be used for vertebrate and preferably human therapeutic agents and for avian and 

10 mammalian veterinary agents, including for livestock breeding. Birds, including, for 

example, chickens, roosters, hens, turkeys, ostriches, ducks, pheasants and quails, can benefit 
from the identification of the gene and pathway for high bone mass. In many examples cited 
in literature (for example, McCoy et al, 1996 Res. Vet Sci., 60: 185-6), weakened bones due 
to husbandry conditions cause cage layer fatigue, osteoporosis and high mortality rates. 

1 5 Additional therapeutic agents to treat osteoporosis or other bone disorders in birds can have 
considerable beneficial effects on avian welfare and the economic conditions of the livestock 
industry, including, for example, meat and egg production. 

23. Methods of use: Diagnostic assays using Zmaxl -specific oligonucleotides for 
20 detection of genetic alterations affecting bone development 

In cases where an alteration or disease of bone development is suspected to involve an 
alteration of the Zmaxl, LRP6, HBM or HBM-like gene, specific oligonucleotides may be 
constructed and used to assess the level of Zmaxl mRNA or HBM mRNA, respectively, in 
bone tissue or in another tissue that affects bone development. 
25 For example, to test whether a person has the HBM gene, which affects bone density, 

polymerase chain reaction can be used. Two oligonucleotides are synthesized by standard 
methods or are obtained from a commercial supplier of custom-made oligonucleotides. The 
length and base composition are determined by standard criteria using the Oligo 4.0 primer 
" Picking program (Wojchich Rychlik, 1992) or any suitable alternative. One of the 
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oligonucleotides is designed so that it will hybridize only to HBM DNA under the PCR 
conditions used. The other oligonucleotide is designed to hybridize a segment of Zmaxl 
genomic DNA such that amplification of DNA using these oligonucleotide primers produces 
a conveniently identified DNA fragment. For example, the pair of primers 
CCAAGTTCTGAGAAGTCC (SEQ ID NO:32) and AATACCTGAAACCATACCTG (SEQ 
ID NO:33) will amplify a 530 base pair DNA fragment from a DNA sample when the 
following conditions are used: step 1 at 95°C for 120 seconds; step 2 at 95°C for 30 seconds; 
step 3 at 58°C for 30 seconds; step 4 at 72°C for 120 seconds; where steps 2-4 are repeated 
35 times. Tissue samples may be obtained from hair follicles, whole blood, or the buccal 
cavity. 

The fragment generated by the above procedure is sequenced by standard techniques. 
Individuals heterozygous for the HBM gene will show an equal amount of G and T at the 
second position in the codon for glycine 171. Normal or homozygous wild-type individuals 
will show only G at this position. Similar routine procedures may be used to develop assays 
for other polymorphisms and variants according to the invention. 

Other amplification techniques besides PCR may be used as alternatives, such as 
ligation-mediated PCR or techniques involving Q-beta replicase (Cahill et al 9 Clin. Chem., 
37(9):1482-5 (1991)). For example, the oligonucleotides 5-AGCTGCTCGTAGCTG 
TCTCTCCCTGGATCACGGGTACATGTACTGGACAGACTGGGT-3 ! (SEQ ID NO:34) 
and T5-GAGACGCCCCGGATTGAGCGGGCAGGGATAGCTTATTCCCTGTGCCGCA 
TTACGGC-3 1 (SEQ ID NO: 35) can be hybridized to a denatured human DNA sample, 
treated with a DNA ligase, and then subjected to PCR amplification using the primer 
oligonucleotides S'-AGCTGCTCGTAGCTGTCTCTCCCTGGA-S 1 (SEQ ID NO:36) and 5 f - 
GCCGTAATGCGGCAC AGGGAATAAGCT-3 ' (SEQ ID NO:37). In the first two 
oligonucleotides, the outer 27 bases are random sequence corresponding to primer binding 
sites, and the inner 30 bases correspond to sequences in the Zmaxl gene. The T at the end of 
the first oligonucleotide corresponds to the HBM gene. The first two oligonucleotides are 
ligated only when hybridized to human DNA carrying the HBM gene, which results in the 
formation of an amplifiable 1 14 bp DNA fragment. 
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Products of amplification can be detected by agarose gel electrophoresis, quantitative 
hybridization, or equivalent techniques for nucleic acid detection known to one skilled in the 
art of molecular biology (Sambrook et ah, Molecular Cloning: A Laboratoiy Manual Cold 
Spring Harbor Laboratory, Cold Spring, NY (1989)). 

Other alterations in the Zmaxl gene or the HBM gene (or LRP6 or HBM-like nucleic 
acids, which are also contemplated herein) may be diagnosed by the same type of 
amplification-detection procedures, by using oligonucleotides designed to identify those 
alterations. These procedures can be used in animals as well as humans to identify alterations 
in Zmaxl or HBM that affect bone development. 

Expression of Zmaxl or HBM in bone tissue may be accomplished by fusing the 
cDNA of Zmaxl or HBM, respectively, to a bone-specific promoter in the context of a vector 
for genetically engineering vertebrate cells. DNA constructs are introduced into cells by 
packaging the DNA into virus capsids, by the use of cationic liposomes, electroporation, or 
by calcium phosphate transfection. Transfected cells, preferably osteoblasts, may be studied 
in culture or may be introduced into bone tissue in animals by direct injection into bone or by 
intravenous injection of osteoblasts, followed by incorporation into bone tissue (Ko et al., 
1996 Cancer Res, 56: 4614-9). For example, the osteocalcin promoter, which is specifically 
active in osteoblasts, may be used to direct transcription of the Zmaxl gene or the iJSMgene. 
Any of several vectors and transfection methods may be used, such as retroviral vectors, 
adenovirus vectors, or vectors that are maintained after transfection using cationic liposomes, 
or other methods and vectors described herein. 

Alteration of the level of functional Zmaxl protein or HBM protein affects the level 
of bone mineralization. By manipulating levels of functional Zmaxl protein or HBM protein, 
it is possible to affect bone development and to increase or decrease levels of bone 
mineralization. For example, it may be useful to increase bone mineralization in patients 
with osteoporosis. Alternatively, it may be useful to decrease bone mineralization in patients 
with osteopetrosis or Paget's disease. Alteration of Zmaxl levels or HBM levels can also be 
used as a research tool. Specifically, it is possible to identify proteins, mRNA and other 
molecules whose level or modification status is altered in response to changes in functional 
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levels of Zmaxl or HBM. The pathology and pathogenesis of bone disorders is known and 
described, for example, in Rubin and Farber (Eds.), Pathology, 2nd Ed., S.B. Lippincott Co., 
Philadelphia, PA (1994). 

A variety of techniques can be used to alter the levels of functional Zmaxl or HBM. 
For example, intravenous or intraosseous injection of the extracellular portion of Zmaxl or 
mutations thereof, or HBM or mutations thereof, will alter the level of Zmaxl activity or 
HBM activity, respectively, in the body of the treated human, animal or bird. Truncated 
versions of the Zmaxl protein or HBM protein can also be injected to alter the levels of 
functional Zmaxl protein or HBM protein, respectively. Certain forms of Zmaxl or HBM 
enhance the activity of endogenous protein, while other forms are inhibitory. 

In a preferred embodiment, the HBM protein is used to treat osteoporosis, fracture, or 
other bone disorder. In a further preferred embodiment, the extracellular portion of HBM or 
fragment thereof (e.g., the Dkk binding domain) protein is used. This HBM protein may be 
optionally modified by the addition of a moiety that causes the protein to adhere to the 
surface of cells. The protein is prepared in a pharmaceutically acceptable solution and is 
administered by injection or another method that achieves acceptable pharmacokinetics and 
distribution. 

In a second embodiment of this method, Zmaxl, HBM, HBM variant, and/or LRP6 
levels are increased or decreased by gene therapy techniques. To increase Zmaxl or HBM 
levels, osteoblasts or another useful cell type are genetically engineered to express high levels 
of Zmaxl or HBM as described above. Alternatively, to decrease Zmaxl or HBM levels, 
antisense constructs that specifically reduce the level of translatable Zmaxl or HBM mRNA 
can be used. In general, a tissue-nonspecific promoter may be used, such as the CMV 
promoter or another commercially available promoter found in expression vectors (Wu et al 9 
1996 Toxicol. Appl Pharmacol 141: 330-9). In a preferred embodiment, a Zmaxl cDNA or 
its antisense is transcribed by a bone-specific promoter, such as the osteocalcin or another 
promoter, to achieve specific expression in bone tissue. In this way, if a Zmaxl -expressing 
DNA construct or HBM-expressing construct is introduced into non-bone tissue, it will not be 
expressed. 
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In a third embodiment of this method, antibodies against Zmaxl, LRP6, HBM-like or 
HBM are used to modulate its function. Such antibodies are identified herein. 

In a fourth embodiment of this method, drugs that are agonists or antagonists of 
Zmaxl function or HBM function are used. Such drugs are described herein and optimized 
according to techniques of medicinal chemistry well known to one skilled in the art of 
pharmaceutical development. 

Zmaxl and HBM interact with several proteins, such as ApoE. Molecules that inhibit 
the interaction between Zmaxl or HBM and ApoE or another binding partner are expected to 
alter bone development and mineralization. Such inhibitors may be useful as drugs in the 
treatment of osteoporosis, osteopetrosis, or other diseases of bone mineralization. Such 
inhibitors may be low molecular weight compounds, proteins or other types of molecules. 
See, Kim et at, 1998 J. Biochem. (Tokyo) 124: 1072-1076. 

Inhibitors of the interaction between Zmaxl or HBM and interacting proteins may be 
isolated by standard drug-screening techniques. For example, Zmaxl protein, (or a fragment 
thereof) or HBM protein (or a fragment thereof) can be immobilized on a solid support such 
as the base of microtiter well. A second protein or protein fragment, such as ApoE is 
derivatized to aid in detection, for example with fluorescein. Iodine, or biotin, then added to 
the Zmaxl or HBM in the presence of candidate compounds that may specifically inhibit this 
protein-protein domain of Zmaxl or HBM, respectively, and thus avoid problems associated 
with its transmembrane segment. Drug screens of this type are well known to one skilled in 
the art of pharmaceutical development. 

Because Zmaxl and HBM are involved in bone development, proteins that bind to 
Zmaxl and HBM are also expected to be involved in bone development. Such binding 
proteins can be identified by standard methods, such as co-immunoprecipitation, co- 
fractionation, or the two-hybrid screen (Ausubel et al, Current Protocols in Molecular 
Biology, John Wiley & Sons (1997)). For example, to identify Zmaxl -interacting proteins or 
HBM-interacting proteins using the two-hybrid system, the extracellular domain of Zmaxl or 
HBM is fused to LexA and expressed for the yeast vector pEG202 (the "bait") and expressed 
in the yeast strain EGY48. The yeast strain is transformed with a "prey" library in the 
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appropriate vector, which encodes a galactose-inducible transcription-activation sequence 
fused to candidate interacting proteins. The techniques for initially selecting and 
subsequently verifying interacting proteins by this method are well known to one skilled in 
the art of molecular biology (Ausubel et aL, 1997). 
5 In a preferred embodiment, proteins that interact with HBM, but not Zmaxl, are 

identified using a variation of the above procedure (Xu et aL, 1997 Proa Natl. Acad. Sci. 
USA, 94: 12473-8). This variation of the two-hybrid system uses two baits, and Zmaxl and 
HBM are each fused to LexA and TetR, respectively. Alternatively, proteins that interact 
with the HBM but not Zmaxl are also isolated. These procedures are well known to one 
10 skilled in the art of molecular biology, and are a simple variation of standard two-hybrid 
procedures. 

As an alternative method of isolating substances interacting with Zmaxl or HBM, a 
biochemical approach is used. The Zmaxl protein or a fragment thereof, such as the 
extracellular domain, or the HBM protein or a fragment thereof, such as the extracellular 

15 domain, is chemically coupled to Sepharose beads. The Zmaxl- or HBM-coupled beads are 
poured into a column. A biological extract, such as a lipid fraction, serum proteins, proteins 
in the supernatant of a bone biopsy, or cellular contents from gently lysed osteoblast cells, is 
added to the column. Non-specifically bound compounds are eluted, the column is washed 
several times with a low-salt buffer, and then tightly binding compounds may be eluted with 

20 a high-salt buffer. These are candidate compounds that bind to Zmaxl or HBM, and can be 
tested for specific binding by standard tests and control experiments. Sepharose beads used 
for coupling proteins and the methods for performing the coupling are commercially 
available (Sigma), and the procedures described here are well known to one skilled in the art 
of protein biochemistry. 

25 As a variation of the above procedure, proteins that are eluted by high salt from the 

Zmaxl- or HBM-sepharose column are then added to an HBM-Zmaxl -sepharose column. 
Proteins that flow through without sticking are proteins that bind to Zmaxl but not to HBM. 
Alternatively, proteins that bind to the HBM protein and not to the Zmaxl protein can be 
isolated by reversing the order in which the columns are used. 
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Isolated compounds may be identified by standard methods such as 2D gel 
electrophoresis, chromatography, and mass spectroscopy. 

24. Method of Use: Trans formation-Associated Recombination (TAR) Cloning 

5 Essential for the identification of novel allelic variants of Zmaxl (LRP5) and LRP6 is 

the ability to examine the sequence of both copies of the gene in an individual. To 
accomplish this, two "hooks," or regions of significant similarity, are identified within the 
genomic sequence such that they flank the portion of DNA that is to be cloned. Most 
preferably, the first of these hooks is derived from sequences 5' to the first exon of interest 

10 and the second is derived from sequences 3 ! to the last exon of interest. These two "hooks" 
are cloned into a bacterial/yeast shuttle vector such as that described by Larionov et al 9 1997 
Proa Natl Acad ScL USA, 94: 7384-7. Other similar vector systems may also be used. To 
recover the entire genomic copy of the Zmaxl gene, the plasmid containing the two "hooks" 
is linearized with a restriction endonuclease or is produced by another method such as PCR. 

1 5 This linear DNA fragment is introduced into yeast cells along with human genomic DNA. 

Typically, the yeast Saccharomyces cerevisiae is used as a host cell, although Kourina et al , 
1998 Genome Res. 8: 666-72 have reported using chicken host cells as well. During and after 
the process of transformation, the endogenous host cell converts the linear plasmid to a circle 
by a recombination event whereby the region of the human genomic DNA homologous to the 

20 "hooks" is inserted into the plasmid. This plasmid can be recovered and analyzed by methods 
well known to one skilled in the art. Obviously, the specificity for this reaction requires the 
host cell machinery to recognize sequences similar to the "hooks" present in the linear 
fragment. However, 100% sequence identity is not required, as shown by Kouprina et ai 9 
1998 Genomics 53: 21-8, where the author describes using degenerate repeated sequences 

25 common in the human genome to recover fragments of human DNA from a rodent/human 
hybrid cell line. 

In another example, only one "hook" is required, as described by Larionov et al, 1998 
Proa Natl Acad. ScL USA, 95: 4469-74. For this type of experiment, termed "radial TAR 
cloning," the other region of sequence similarity to drive the recombination is derived from a 
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repeated sequence from the genome. In this way, regions of DNA adjacent to the Zmaxl 
gene coding region can be recovered and examined for alterations that may affect function. 

25. Methods of Use: Genomic Screening 

5 The use of polymorphic genetic markers linked to the HBM gene, or to Zmaxl 

(LRP5) or to LRP6 is very useful in predicting susceptibility to osteoporosis or other bone 
diseases. Koller et at, 1998 Amer. X Bone Min. Res. 13: 1903-8 have demonstrated that the 
use of polymorphic genetic markers is useful for linkage analysis. Similarly, the 
identification of polymorphic genetic markers within the high bone mass gene will allow the 

10 identification of specific allelic variants that are in linkage disequilibrium with other genetic 
lesions that affect bone development. Using the DNA sequence from the B ACs, a 
dinucleotide CAn repeat was identified and two unique PCR primers that will amplify the 
genomic DNA containing this repeat were designed, as shown below: 

B200E21C16JL: GAGAGGCTATATCCCTGGGC (SEQ ID NO: 38) 

15 B200E21C16_R: ACAGCACGTGTTTAAAGGGG (SEQ ID NO: 39) 

and used in the genetic mapping study. 

This method has been used successfully by others skilled in the art (e.g., Sheffield et 
at, 1995 Genet, 4:1837-44; LeBlanc-Straceski et at, 1994 Genomics, 19: 341-9; Chen et at, 
1995 Genomics, 25:1-8). Use of these reagents with populations or individuals will predict 

20 their risk for osteoporosis. Similarly, single nucleotide polymorphisms (SNPs), such as those 
shown in Table 4 above, can be used as well to predict risk for developing bone diseases or 
resistance to osteoporosis in the case of the HBM gene. 

26. Methods of Use: Modulators of Tissue Calcificat i o n 

25 The calcification of tissues in the human body is well documented. Towler et at 

(1998 Biol Chem. 273: 30427-34) demonstrated that several proteins known to regulate 
calcification of the developing skull in a model system are expressed in calcified aorta. The 
expression of Msx2, a gene transcribed in osteoprogenitor cells, in calcified vascular tissue 
indicates that genes which are important in bone development are involved in calcification of 
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other tissues. 

Treatment with HBM protein, HBM-like proteins and polypeptides, agonists of LRP5 
and LRP6 and HBM or antagonists of Dkk-1 are likely to ameliorate calcification (such as the 
vasculature, dentin and bone of the skull viscera) due to its demonstrated effect on bone 
mineral density. In experimental systems where tissue calcification is demonstrated, the 
over-expression or repression of Zmaxl(LRP5) activity permits the identification of 
molecules that are directly regulated by the Zmaxl gene. These genes are potential targets for 
therapeutics aimed at modulating tissue calcification. For example, an animal, such as the 
LDLR -/-, mouse is fed a high fat diet and is observed to demonstrate expression of markers 
of tissue calcification, including Zmaxl. These animals are then treated with antibodies to 
Zmaxl or HBM protein, antisense oligonucleotides directed against Zmaxl or HBM cDNA, 
or with compounds known to bind the Zmaxl or HBM protein or its binding partner or 
ligand. RNA or proteins are extracted from the vascular tissue and the relative expression 
levels of the genes expressed in the tissue are determined by methods well known in the art. 
Genes that are regulated in the tissm are potential therapeutic targets for pharmaceutical 
development as modulators of tissue calcification. 

The nucleic acids, proteins, peptides, amino acids, small molecules or other 
pharmaceutical^ useful compounds of the present invention that are to be given to an 
individual may be administered in the form of a composition with a pharmaceutically 
acceptable carrier, excipient or diluent, which are well known in the art. The individual may 
be a mammal or a bird, preferably a human, a rat, a mouse or bird. Such compositions may 
be administered to an individual in a pharmaceutically effective amount. The amount 
administered will vary depending on the condition being treated and the patient being treated. 
The compositions may be administered alone or in combination with other treatments. 

27. Methods to Tdentifv A gents that Modul ate the Exp re ssion of a Nuc leic Acid 
E n coding the Dk k and/or TRP 5 Proteins a nd/or Dkk interacting p roteins 

Another embodiment of the present invention provides methods for identifying agents 
that modulate the expression of a nucleic acid encoding Dkk, which is part of the Wnt 
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pathway and that interacts with LRP5, LRP6 and to much lesser extent to HBM and its 
variants. Such assays may utilize any available means of monitoring for changes in the 
expression level of the nucleic acids of the invention. As used herein, an agent is said to 
modulate the expression of Dkk, if it is capable of up- or down-regulating expression of the 
5 nucleic acid in a cell (e.g., mRNA). 

In one assay format, cell lines that contain reporter gene fusions between the nucleic 
acid encoding Dkk (or proteins which modulate the activity of Dkk) and any assayable fusion 
partner may be prepared. Numerous assayable fusion partners are known and readily 
available, including but not limited to the firefly luciferase gene and the gene encoding 

10 chloramphenicol acetyltransferase (Alam et al, 1990 Anal Biochem. 188: 245-54). Cell lines 
containing the reporter gene fusions are then exposed to the agent to be tested under 
appropriate conditions and time. Differential expression of the reporter gene between 
samples exposed to the agent and control samples identifies agents which modulate the 
expression of a nucleic acid encoding Dkk or other protein which modulates Dkk activity. 

15 Such assays can similarly be used to determine whether LRP5 and even LRP6 activity is 
modulated by regulating Dkk activity. This can also be performed with the HBM variants. 

Additional assay formats may be used to monitor the ability of the agent(s) to 
modulate the expression of a nucleic acid encoding Dkk, alone or Dkk and LRP5, and/or Dkk 
interacting proteins such as those identified in Figure 31. For instance, mRNA expression 

20 may be monitored directly by hybridization to the nucleic acids of the invention. Cell lines 
are exposed to the agent to be tested under appropriate conditions and time and total RNA or 
mRNA is isolated by standard procedures such those disclosed in Sambrook et at (1989); 
Ausubel et aL, Current Protocols in Molecular Biology (Greene Publishing Co., NY, 1995); 
Maniatis et aL, Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory, 

25 Cold Spring Harbor, NY, 1982); and Short Protocols in Molecular Biology; A Compendium 
of Methods from Current Protocols in Molecular Biolo gy (Frederick M. Ausubel et aL, April 
1999). 

Probes to detect differences in RNA expression levels between cells exposed to the 
agent and control cells may be prepared from the nucleic acids of the invention. It is 
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preferable, but not necessary, to design probes which hybridize only with target nucleic acids 
under conditions of high stringency. Only highly complementary nucleic acid hybrids form 
under conditions of high stringency. Accordingly, the stringency of the assay conditions 
determines the amount of complementarity which should exist between two nucleic acid 
strands in order to form a hybrid. Stringency should be chosen to maximize the difference in 
stability between the probe:target hybrid and potential probemon-target hybrids. 

Probes may be designed from the nucleic acids of the invention through methods 
known in the art. For instance, the G+C content of the probe and the probe length can affect 
probe binding to its target sequence. Methods to optimize probe specificity are commonly 
available. See for example, Sambrook et al (1989) or Ausubel et al (Current Protocol sin 
Molecular Biology, Greene Publishing Co., NY, 1995). 

Hybridization conditions are modified using known methods, such as those described 
by Sambrook et al (1989) and Ausubel et al (1995), as suitable for each probe. 
Hybridization of total cellular RNA or RNA enriched for polyA RNA can be accomplished in 
any available format. For instance, total cellular RNA or RNA enriched for polyA RNA can 
be affixed to a solid support and the solid support exposed to at least one probe comprising at 
least one, or part of one of the nucleic acid sequences of the invention under conditions in 
which the probe will specifically hybridize. Alternatively, nucleic acid fragments comprising 
at least one, or part of one of the sequences of the invention can be affixed to a solid support, 
such as a porous glass wafer. The glass or silica wafer can then be exposed to total cellular 
RNA or polyA RNA from a sample under conditions in which the affixed sequences will 
specifically hybridize. Such glass wafers and hybridization methods are widely available, for 
example, those disclosed by Beattie (WO 95/1 1755). By examining for the ability of a given 
probe to specifically hybridize to an RNA sample from an untreated cell population and from 
a cell population exposed to the agent, agents which up- or down-regulate the expression of a 
nucleic acid encoding Dkk, a Dkk interacting protein, and/or LRP5 can be identified. 

Microarray technology and transcriptional profiling are examples of methods which 
can be used to analyze the impact of putative Dkk or Dkk interacting protein modulating 
compounds. For transcriptional profiling, mRNA from cells exposed in vivo to a potential 
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Dkk modulating agent, such as the Dkk interacting proteins identified in the present invention 
(e.g., those identified in Fig. 3 1), agents which modulate Dkk interacting proteins, and 
mRNA from the same type of cells that were not exposed to the agent could be reverse 
transcribed and hybridized to a chip containing DNA from numerous genes, to thereby 
5 compare the expression of genes in cells treated and not treated with the agent. If, for 

example a putative Dkk modulating agent down-regulates the expression of Dkk in the cells, 
then use of the agent may be undesirable in certain patient populations. For additional 
methods of transcriptional profiling and the use of microarrays, refer to, for example, U.S. 
Patent No. 6,124,120 issued to Lizardi (2000). 

10 Additional methods for screening the impact of Dkk and Dkk interacting protein 

modulating compounds or the impact of Dkk or Dkk interacting proteins on modulation of 
LRP5, LRP6, HBM, HBM variants or the Wnt pathway include the use of TaqMan® PGR, 
conventional reverse transcriptase PCR (RT-PCR), changes in downstream surrogate markers 
(i.e., Wnt responsive genes), and anti-Dkk Western blots for protein detection. Other 

1 5 methods would be readily apparent to the artisan of ordinary skill 

28. Methods to Identify Ag ents that Modulate at Least One Activity of Dkk. a Dkk 
Interacting Protein, or LRP5/LRP6/HBM/HBM-like 

Another embodiment of the present invention provides methods for identifying agents 
20 that modulate at least one activity of Dkk, Dkk interacting proteins, and/or 

LRP5/LRP6/HBM/HBM-like proteins or preferably which specifically modulate an activity 
of a Dkk/Dkk interacting protein complex or an LRP5 (or LRP6/HBM/HBM-like)/Dkk 
complex, or a biologically active fragment of Dkk (e.g., comprising the domain which binds 
LRP5/LRP6/HBM/HBM-like) or a Dkk interacting protein complex. Such methods or 
25 assays may utilize any means of monitoring or detecting the desired activity as would be 
known in the art (See, e.g. t Wu et aL, 2000 Cun\ Biol. 10:1611-4; Fedi et ah, 199 J. Biol. 
Chem. 274:19465-72; Grotewold^a/., 1999 Mech. Dev. 89:151-3; Shibataerfa/., 2000 
Meek Dev. 96: 243-6 ; Wang et aL, 2000 Oncogene 19: 1843-8; and Glinka et aL, 1998 
Nature 391 : 357-62). Potential agents which modulate Dkk include, for example, p53, the 
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tumor suppressor protein, which can induce Dkk-1 . Damage to DNA has also been observed 
to up-regulate Dkk-1 expression via a stabilization and activation of p53 (Wang et al, 2000 
Oncogene 19: 1843-48); and, Shou et aL, 2002 Oncogene 21: 878-89). Additionally, Fedi et 
al (1999) purportedly showed that Dkk-1 can block the Wnt2-induced oncogenic 
transformation of NIH-3T3 cells. Furthermore, it has been suggested that Dkk expression 
may be modulated by BMP signaling in the developing skeleton (Mukhopadhyay et al, 2001 
Dev. Cell 1: 423-34; and Grotewold etaL, 2002 EMBO J. 21: 966-75). Grotewald et al 
additionally describe altered Dkk expression levels in response to stress signals including UV 
irradiation and other genotoxic stimuli. They propose that Dkk expression is pro-apoptotic. 
In the HBMMTIC animals described herein, a reduced osteoblast apoptosis effect was 
observed. Thus, HBM and HBM like variants may control/alter Dkk's role in programmed 
cell death. Other agents which potentially modulate Dkk activity include the Dkk interacting 
proteins identified in Fig. 31. 

In one embodiment, the relative amounts of Dkk or a Dkk interacting protein of a cell 
population that has been exposed to the agent to be tested is compared to an un-exposed 
control cell population. Antibodies can be used to monitor the differential expression of the 
protein in the different cell populations. Cell lines or populations are exposed to the agent to 
be tested under appropriate conditions and time. Cellular lysates may be prepared from the 
exposed cell line or population and a control, unexposed cell line or population. The cellular 
lysates are then analyzed with the probe, as would be known in the art. See, e.g., Ed Harlow 
and David Lane, Antibodies: A Laborato ry Manual (Cold Spring Harbor, NY, 1988) and Ed 
Harlow and David Lane, Using Antibodies: A Laboratory Manual (Cold Spring Harbor, NY 
1998). 

For example, N- and C- terminal fragments of Dkk can be expressed in bacteria and 
used to search for proteins which bind to these fragments. Fusion proteins, such as His-tag or 
GST fusion to the N- or C-terminal regions of Dkk (or to biologically active domains of Dkk- 
1) or a whole Dkk protein can be prepared. These fusion proteins can be coupled to, for 
example, Talon or Glutathione-Sepharose beads and then probed with cell lysates to identify 
molecules which bind to Dkk. Prior to lysis, the cells may be treated with purified Wnt 
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proteins, RNA, or drugs which may modulate Wnt signaling or proteins that interact with 
downstream elements of the Wnt pathway. Lysate proteins binding to the fusion proteins can 
be resolved by SDS-PAGE, isolated and identified by, for example protein sequencing or 
mass spectroscopy, as is known in the art. See, e.g., Protein Pur ification Applications: A 
5 Practical Approach (Simon Roe, ed., 2 nd ed. Oxford Univ. Press, 2001) and "Guide to Protein 
Purification" in Metlu Enzymology vol. 182 (Academic Press, 1997). 

The activity of Dkk, a Dkk interacting protein, or a complex of Dkk with 
LRP5/LRP6/HBM/HBM-like may be affected by compounds which modulate the interaction 
between Dkk and a Dkk interacting protein (such as those shown in Fig. 31) and/or Dkk and 

10 LRP5/LRP6/HBM/HBM-like. The present invention provides methods and research tools 

for the discovery and characterization of these compounds. The interaction between Dkk and 
a Dkk interacting protein and/or Dkk and LRP5/6/HBM/HBM-like may be monitored in vivo 
and in vitro. Compounds which modulate the stability of a Dkk - LRP5/LRP6/HBM/HBM- 
like complex are potential therapeutic compounds. Example in vitro methods include: 

1 5 Binding LRP5/6/HBM/HBM-like, Dkk, or a Dkk interacting protein to a sensor chip 

designed for an instrument such are made by Biacore (Uppsala, Sweden) for the performance 
of an plasmon resonance spectroscopy observation. In this method, the chip with one of Dkk, 
a Dkk interacting protein, or LRP5/6 is first exposed to the other under conditions which 
permit them to form the complex. A test compound is then introduced and the output signal 

20 of the instrument provides an indication of any effect exerted by the test compound. By this 
method, compounds may be rapidly screened. Another, in vitro, method is exemplified by 
the SAR-by-NMR methods (Shuker et aL, Science. 274:1531-4 (1996)). Briefly, aDkk-1 
binding domain and/or LRP 5 or 6 LBD are expressed and purified as 15 N labeled protein by 
expression in labeled media. The labeled protein(s) are allowed to form the complex in 

25 solution in an NMR sample tube. The heteronuclear correlation spectrum in the presence and 
absence of a test compound provides data at the level of individual residues with regard to 
interactions with the test compound and changes at the protein-protein interface of the 
complex. One of skill in the art knows of many other protocols, e.g. affinity capillary 
electrophoresis (Okun et aL, 2001 J. Biol Chem. 276: 1057-62; Vergnon et al, 1999 
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Methods, 19: 270-7), fluorescence spectroscopy, electron paramagnetic resonance, etc. which 
can monitor the modulation of a complex and/or measure binding affinities for complex 
formation. 

In vitro protocols for monitoring the modulation of a Dldo^RP5/LRP6/HBM/HBM- 

5 like complex include the yeast two hybrid protocol. The yeast two hybrid method may be 
used to monitor the modulation of a complex in vivo by monitoring the expression of genes 
activated by the formation of a complex of fusion proteins of Dkk and LRP ligand binding 
domains. Nucleic acids according to the invention which encode the interacting Dkk and 
LRP LBD domains are incorporated into bait and prey plasmids. The Y2H protocol is 

10 performed in the presence of one or more test compounds. The modulation of the complex is 
observed by a change in expression of the complex activated gene. It will be appreciated by 
one skilled in the art that test compounds can be added to the assay directly or, in the case of 
proteins, can be coexpressed in the yeast with the bait and prey compounds. Similarly, 
fusion proteins of Dkk and Dkk interacting proteins can also be used in a Y2H screen to 

1 5 identify other proteins which modulate the Dkk/Dkk interacting protein complex. 

Assay protocols such as these may be used in methods to screen for compounds, 
drugs, treatments which modulate the Dkk/Dkk interacting protein and/or Dkk/LRP5/6 
complex, whether such modulation occurs by competitive binding, or by altering the 
structure of either LRP 5/6 or Dkk at the binding site, or by stabilizing or destablizing the 

20 protein-protein interface. It may be anticipated that peptide aptamers may competitively 

bind, although induction of an altered binding site structure by steric effects is also possible. 

28.1 Antibodies and Antibody Fragments 

Polyclonal and monoclonal antibodies and fragments of these antibodies which bind 
25 to Dkk or LRP5/LRP6/HBM/HBM-like can be prepared as would be known in the art. For 
example, suitable host animals can be immunized using appropriate immunization protocols 
and the peptides, polypeptides or proteins of the invention. Peptides for use in immunization 
are typically about 8-40 residues long. If necessary or desired, the polypeptide immunogens 
can be conjugated to suitable carriers. Methods for preparing immunogenic conjugates with 
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carriers such as bovine serum albumin (BSA), keyhole limpet hemocyanin (KLH), or other 
carrier proteins are well known in the art {See, Harlow et al 9 1988). In some circumstances, 
direct conjugation using, for example, carbodiimide reagents, may be effective; in other 
instances linking reagents such as those supplied by Pierce Chemical Co., Rockford, IL, may 
5 be desirable to provide accessibility to the polypeptide or hapten. The hapten peptides can be 
extended at either the amino or carboxy terminus with a cysteine residue or interspersed with 
cysteine residues, for example, to facilitate linking to a carrier. Administration of the 
immunogens is conducted generally by injection over a suitable time period and with use of 
suitable adjuvants, as is generally understood in the art. During the immunization schedule, 

10 titers of antibodies are taken to determine adequacy of antibody formation. 

Anti-peptide antibodies can be generated using synthetic peptides, for example, the 
peptides derived from the sequence of any Dkk, including Dkk-1, or 
LRP5/LRP6/HBM/HBM-like. Synthetic peptides can be as small as 2-3 amino acids in 
length, but are preferably at least 3, 5, 10, or 15 or more amino acid residues long. Such 

15 peptides can be determined using programs such as DNAStar. The peptides are coupled to 

KLH using standard methods and can be immunized into animals such as rabbits. Polyclonal 
anti-Dkk or anti-LRP5/l,RP6/HBM/HBM-like peptide antibodies can then be purified, for 
example using Actigel beads containing the covalently bound peptide. 

While the polyclonal antisera produced in this way may be satisfactory for some 

20 applications, for pharmaceutical compositions, use of monoclonal preparations is preferred. 
Immortalized cell lines which secrete the desired monoclonal antibodies may be prepared 
using the standard method of Kohler and Milstein or modifications which effect 
immortalization of lymphocytes or spleen cells, as is generally known (See, e.g., Harlow et 
al, 1988 and 1998). The immortalized cell lines secreting the desired antibodies can be 

25 screened by immunoassay in which the antigen is the peptide hapten, polypeptide or protein. 
When the appropriate immortalized cell culture secreting the desired antibody is identified, 
the cells can be cultured either in vitro or by production in ascites fluid. 

The desired monoclonal antibodies are then recovered from the culture supernatant or 
from the ascites supernatant. Fragments of the monoclonal antibodies which contain the 

193 



WO 02/092000 



PCTYUS02/14877 



immunologically significant portion can be used as agonists or antagonists of Dkk activity. 
Use of immunologically reactive fragments, such as the Fab, scFV, Fab ! , of F(ab ! ) 2 fragments 
are often preferable, especially in a therapeutic context, as these fragments are generally less 
immunogenic than the whole immunoglobulin. 

5 The antibodies or fragments may also be produced, using current technology, by 

recombinant means. Regions that bind specifically to the desired regions of Dkk or 
LRP5/LRP6/HB3VI/HBM-like can also be produced in the context of chimeras with multiple 
species origin. Antibody reagents so created are contemplated for use diagnostically or as 
stimulants or inhibitors of Dkk activity. 

10 In one embodiment, antibodies against Dkk, bind Dkk with high affinity, i.e., ranging 

from 10~ 5 to 10" 9 M. Preferably, the anti-Dkk antibody will comprise a chimeric, primate, 
Primatized®, human or humanized antibody. Also, the invention embraces the use of 
antibody fragments, e.g., Fab's, Fv's, Fab's, F(ab) 2 , and aggregates thereof. 

Another embodiment contemplates chimeric antibodies which recognize Dkk or 

15 LRP5/LRP6/HBM/HBM-like. A chimeric antibody is intended to refer to an antibody with 
non-human variable regions and human constant regions, most typically rodent variable 
regions and human constant regions. 

A "primatized® antibody" refers to an antibody with primate variable regions, e.g., 
CDR's, and human constant regions. Preferably, such primate variable regions are derived 

20 from an Old World monkey. 

A "humanized antibody" refers to an antibody with substantially human framework 
and constant regions, and non-human complementarity-determining regions (CDRs). 
"Substantially" refers to the fact that humanized antibodies typically retain at least several 
donor framework residues (i.e., of non-human parent antibody from which CDRs are 

25 derived). 

Methods for producing chimeric, primate, primatized®, humanized and human 
antibodies are well known in the art. See, e.g., U.S. Patent 5,530,101, issued to Queen et ah ; 
U.S. Patent 5,225,539, issued to Winter et ah; U.S. Patents 4,816,397 and 4,816,567, issued 
to Boss et al. and Cabilly et al respectively, all of which are incorporated by reference in 
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their entirety. 

The selection of human constant regions may be significant to the therapeutic efficacy 
of the subject anti-Dkk or LRP5/LRP6/HBM/HBM-like antibody. In a preferred 
embodiment, the subject anti-Dkk or LRP5/LRP6/HBM/HBM-like antibody will comprise 
5 human, gamma 1, or gamma 3 constant regions and, more preferably, human gamma 1 
constant regions. 

Methods for making human antibodies are also known and include, by way of 
example, production in SCED mice, and in vitro immunization. 

The subject anti-Dkk or LRP5/LRP6/HBM/HBM-like antibodies can be administered 
10 by various routes of administration, typically parenteral. This is intended to include 
intravenous, intramuscular, subcutaneous, rectal, vaginal, and administration with 
intravenous infusion being preferred. 

The anti-Dkk or LRP5/LRP6/HBM/HBM-like antibody will be formulated for 
therapeutic usage by standard methods, e.g., by addition of pharmaceutical^ acceptable 
15 buffers, e.g., sterile saline, sterile buffered water, propylene glycol, and combinations thereof. 

Effective dosages will depend on the specific antibody, condition of the patient, age, 
weight, or any other treatments, among other factors. Typically effective dosages will range 
from about 0.001 to about 30 mg/kg body weight, more preferably from about 0.01 to 25 
mg/kg body weight, and most preferably from about 0.1 to about 20 mg/kg body weight. 
20 Such administration may be effected by various protocols, e.g., weekly, bi-weekly, or 

monthly, depending on the dosage administered and patient response. Also, it may be 
desirable to combine such administration with other treatments. 

Antibodies to Dkk-1 interacting proteins, such as those identified in Fig. 31, are also 
contemplated according to the present invention, and can be used similarly to the Dkk-1 
25 antibodies mentioned in the above methodology. 

The antibodies of the present invention can be utilized in experimental screening, as 
diagnostic reagents, and in therapeutic compositions. 

28.2 Chemical Libraries 
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Agents that are assayed by these methods can be randomly selected or rationally 
selected or designed. As used herein, an agent is said to be randomly selected when the agent 
is chosen randomly without considering the specific sequences involved in the association of 
Dkk-1 alone, Dkk-1 interacting proteins alone, or with their associated substrates, binding 
partners, etc. An example of randomly selected agents is the use of a chemical library or a 
peptide combinatorial library, or a growth broth of an organism. 

The agents of the present invention can be, as examples, peptides, small molecules, 
vitamin derivatives, as well as carbohydrates. A skilled artisan can readily recognize that 
there is no limit as to the structural nature of the agents of the present invention. 

28.3 Peptide Synthesis 

The peptide agents of the invention can be prepared using standard solid phase (or 
solution phase) peptide synthesis methods, as is known in the art. In addition, the DNA 
encoding these peptides may be synthesized using commercially available oligonucleotide 
synthesis instrumentation and produced recombinantly using standard recombinant 
production systems. The production of polypeptides using solid phase peptide synthesis is 
necessitated if non-nucleic acid-encoded amino acids are to be included. 

29. Uses for Agents that Modulate at Least One Activity of Dkk. a D kk Interacting 
Protein, a Pkk/Pkk Interactin g Protein Complex, or a Dkk/LRP5 or I)kk /LRP6 
Complex 

The proteins and nucleic acids of the invention, such as the proteins or polypeptides 
containing an amino acid sequence of LRP5, Dkk, and Dkk interacting proteins are involved 
in bone mass modulation and lipid modulation of other Wnt pathway mediated activity. 
Agents that modulate (i.e., up and down-regulate) the expression of Dkk or Dkk interacting 
proteins, or agents, such as agonists and antagonists respectively, of at least one activity of 
Dkk or a Dkk interacting protein may be used to modulate biological and pathologic 
processes associated with the function and activity of Dkk or a Dkk interacting protein. 

As used herein, a subject can be preferably any mammal, so long as the mammal is in 
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need of modulation of a pathological or biological process modulated by a protein of the 
invention. The term "mammal" means an individual belonging to the class Mammalia. The 
invention is particularly useful in the treatment of human subjects. 

As used herein, a biological or pathological process modulated by Dkk or a Dkk 
interacting protein may include binding of Dkk to a Dkk interacting protein, Dkk to LRP5 or 
LRP6 or release therefrom, inhibiting or activating Dkk or a Dkk interacting protein mRNA 
synthesis or inhibiting Dkk or Dkk interacting protein modulated inhibition of LRP5 or LRP6 
mediated Wnt signaling. Further bone-related markers may be observed such as alkaline 
phosphatase activity, osteocalcin production, or mineralization. 

Pathological processes refer to a category of biological processes which produce a 
deleterious effect. For example, expression or up-regulation of expression of LRP5 or LRP6 
and/or Dkk and/or a Dkk interacting protein may be associated with certain diseases or 
pathological conditions. As used herein, an agent is said to modulate a pathological process 
when the agent statistically significantly (p < 0.05) alters the process from its base level in 
the subject. For example, the agent may reduce the degree or severity of the process 
mediated by that protein in the subject to which the agent was administered. For instance, a 
disease or pathological condition may be prevented, or disease progression modulated by the 
administration of agents which reduce or modulate in some way the expression or at least one 
activity of a protein of the invention. 

As LRP5/6 and Dkk are involved both directly and indirectly in bone mass 
modulation, one embodiment of this invention is to use Dkk or Dkk interacting protein 
expression as a method of diagnosing a bone condition or disease. Certain markers are 
associated with specific Wnt signaling conditions {e.g., TCF/LEF activation). Diagnostic 
tests for bone conditions may include the steps of testing a sample or an extract thereof for 
the presence of Dkk or Dkk interacting protein nucleic acids {i.e., DNA or RNA), oligomers 
or fragments thereof or protein products of TCF/LEF regulated expression. For example, 
standard in situ hybridization or other imaging techniques can be utilized to observe products 
of Wnt signaling. Other diagnostic techniques, as described herein, would also be useful as 
would be apparent to the skilled artisan (e.g., a serum marker). 
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This invention also relates to methods of modulating bone development or bone loss 
conditions. Inhibition of bone loss may be achieved by inhibiting or modulating changes in 
the LRP5/6 mediated Wnt signaling pathway. For example, absence of LRP5 activity may 
be associated with low bone mass. Increased activity LRP5 may be associated with high 
5 bone mass. Therefore, modulation of LRP5 activity will in turn modulate bone development. 
Modulation of the Dkk/LRP5/6 or Dkk/Dkk interacting protein complex via agonists and 
antagonists is one embodiment of a method to regulate bone development. Such modulation 
of bone development can result from inhibition of the activity of, for example, a 
Dkk/LRP(5/6) protein complex, a Dkk/Dkk interacting protein complex, upregulated 

10 transcription of the LRP5 gene or inhibited transcription or translation of Dkk or Dkk 
interacting protein mRNA. 

The agents of the present invention can be provided alone, or in combination with 
other agents that modulate a particular pathological process. As used herein, two agents are 
said to be administered in combination when the two agents are administered simultaneously 

15 or are administered independently in a fashion such that the agents will act at the same time. 

The agents of the present invention can be administered via parenteral, subcutaneous 
(sc), intravenous (iv), intramuscular (im), intraperitoneal (ip), transdermal or buccal routes. 
Alternatively, or concurrently, administration may be by the oral route. The dosage 
administered will be dependent upon the age, health, and weight of the recipient, kind of 

20 concurrent treatment, if any, frequency of treatment, and the nature of the effect desired. 

The present invention further provides compositions containing one or more agents 
which modulate expression or at least one activity of a protein of the invention. While 
individual needs vary, determination of optimal ranges of effective amounts of each 
component is within the skill of the art. Typical dosages of the active agent which mediate 

25 Dkk or Dkk interacting protein activity comprise from about 0.0001 to about 50 mg/kg body 
weight. The preferred dosages comprise from about 0.001 to about 50 mg/kg body weight. 
The most preferred dosages comprise from about 0.1 to about 1 mg/kg body weight. In an 
average human of 70 kg, the range would be from about 7 ^g to about 3.5 g, with a preferred 
range of about 0.5 mg to about 5 mg. 
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In addition to the pharmacologically active agent, the compositions of the present 
invention may contain suitable pharmaceutically acceptable carriers comprising excipients 
and auxiliaries which facilitate processing of the active compounds into preparations which 
can be used pharmaceutically for delivery to the site of action. Suitable formulations for 
5 parenteral administration include aqueous solutions of the active compounds in water-soluble 
form, for example, water-soluble salts. In addition, suspensions of the active compounds as 
appropriate oily injection suspensions may be administered. Suitable lipophilic solvents or 
vehicles include fatty oils, for example, sesame oil, or synthetic fatty acid esters, {e.g., ethyl 
oleate or triglycerides). Aqueous injection suspensions may contain substances which 
10 increase the viscosity of the suspension include, for example, sodium carboxymethyl 

cellulose, sorbitol and/or dextran. Optionally, the suspension may also contain stabilizers. 
Liposomes and other non- viral vectors can also be used to encapsulate the agent for delivery 
into the cell. 

The pharmaceutical formulation for systemic administration according to the 
15 invention may be formulated for enteral, parenteral, or topical (top) administration. Indeed, 
all three types of formulations may be used simultaneously to achieve systemic 
administration of the active ingredient. 

Suitable formulations for oral administration include hard or soft gelatin capsules, 
pills, tablets, including coated tablets, elixirs, suspensions, syrups or inhalations and 
20 controlled release forms thereof. 

Potentially, any compound which binds Dkk or a Dkk interacting protein or 
modulates the Dkk/LRP5 or Dkk/LRP6 or Dkk/Dkk interacting protein complex may be a 
therapeutic compound. In one embodiment of the invention, a peptide or nucleic acid 
aptamer according to the invention is used in a therapeutic composition. Such compositions 
25 may comprise an aptamer, or a LRP5 or LRP6 fragment unmodified or modified. In another 
embodiment, the therapeutic compound comprises a Dkk-1 interacting protein, or 
biologically active fragment thereof. 

Nucleic acid aptamers have been used in compositions for example by chemical 
bonding to a carrier molecule such as polyethylene glycol (PEG) which may facilitate uptake 



199 



WO 02/092000 



PCT/US02/14877 



or stabilize the aptamer. A di-alkylgylcerol moiety attached to an RNA will embed the 
aptamer in liposomes, thus stabilizing the compound. Incorporating chemical substitutions 
(i.e. changing the 2'OH group of ribose to a ll^H in RNA confers ribonuclease resistance) 
and capping, etc. can prevent breakdown. Several such techniques are discussed for RNA 
aptamers in Brody and Gold (Rev. Mol Biol 74:3-13, 2000). 

Peptide aptamers may by used in therapeutic applications by the introduction of an 
expression vector directing aptamer expression into the affected tissue such as for example by 
retroviral delivery, by encapsulating the DNA in a delivery complex or simple by naked 
DNA injection. Or, the aptamer itself or a synthetic analog may be used directly as a drug. 
Encapsulation in polymers and lipids may assist in delivery. The use of peptide aptamers as 
therapeutic and diagnostic agents is reviewed by Hoppe-Syler and Butz (J. Mol Med. 
78:426-430(2000)). 

In another aspect of the invention. The structure of a constrained peptide aptamer of 
the invention may be determined such as by NMR or X-ray crystallography. (Cavanagh et 
al> Protein N MR Spectroscopy: Princip les and Practice, Academic Press, 1996; Drenth, 
Principles of Protein X-Ray Crystallography, Springer Verlag, 1999) Preferably the structure 
is determined in complex with the target protein. A small molecule analog is then designed 
according to the positions of functional elements of the 3D structure of the aptamer. 
(Guidebook on Molecular Modeling in Drug Design , Cohen, Ed., Academic Press, 1996; 
Molecular Modelin? and Dr ug Design (Topi c s in Mo lecular and S tructural Biology), Vinter 
and Gardner Eds., CRC Press, 1994) Thus the present invention provides a method for the 
design of effective and specific drugs which modulate the activity of Dkk, Dkk interacting 
proteins, Dkk/Dkk interacting protein complex and the Dkk/LRP complex. Small molecule 
mimetics of the peptide aptamers of the present invention are encompassed within the scope 
of the invention. 

In practicing the methods of this invention, the compounds of this invention may be 
used alone or in combination, or in combination with other therapeutic or diagnostic agents. 
In certain preferred embodiments, the compounds of this invention may be co-administered 
along with other compounds typically prescribed for these conditions according to generally 
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accepted medical practice. For example, the compounds of this invention can be 
administered in combination with other therapeutic agents for the treatment of bone loss. 
Bone loss mediating agents include bone resorption inhibitors such as bisphosphonates (e.g., 
alendronic acid, clodronic acid, etidronic acid, pamidronic acid, risedronic acid and tiludronic 
5 acid), vitamin D and vitamin D analogs, cathepsin K inhibitors, honnonal agents (e.g., 
calcitonin and estrogen), and selective estrogen receptor modulators or SERMs (e.g., 
raloxifene). And bone forming agents such as parathyroid hormone (PTH) and bone 
morphogenetic proteins (BMP). 

Additionally contemplated are combinations of agents which regulate Dkk-1 and 

10 agents which regulate lipid levels such as HMG-CoA reductase inhibitors (i.e., statins such as 
Mevacor®, Lipitor® and other inhibitors such as Baycol®, Lescol®, Pravachol® and 
Zocor®), bile acid sequestrants (e.g., Colestid® and Welchol®), fibric acid derivatives 
(Atromid-S®, Lopid®, Tricor®), and nicotinic acid. 

The compounds of this invention can be utilized in vivo, ordinarily in vertebrates and 

15 preferably in mammals, such as humans, sheep, horses, cattle, pigs, dogs, cats, rats and mice, 
or in vitro. 

30. Pe ptide and Nucleotide Aptamers and Peptide Aptamer Mimetics 

Another embodiment contemplates the use of peptide and nucleotide aptamer 

20 technology to screen for agents which interact with Dkk, which block Dkk from interacting 
with LRP5, LRP6, HBM, or HBM-like molecules, or which block any other Dkk ligand 
interaction, or which interact with Dkk interacting proteins, such as those shown in Figure 5. 
Peptide aptamers are molecules in which a variable peptide domain is displayed from a 
scaffold protein. Thioredoxin A (trxA) is commonly used for a scaffold. The peptide insert 

25 destroys the catalytic site of trxA. It is recognized that numerous proteins may also be used 
as scaffolding proteins to constrain and/or present a peptide aptamer. Other scaffold proteins 
that could display a constrained peptide aptamer could include staphylococcal nuclease, the 
protease inhibitor eglin C, the Sfreptomyces tendea alpha-amylase inhibitor Tendamistat, 
Spl, green fluorescent protein (GFP) (reviewed in Hoppe-Seyler et aL, 2001 J. Steroid 
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Biochem Mol Biol 78: 105-1 1), and the Si nucleas from Staphyloccus or M13 for phage 
display.. Any molecule to which the aptamer could be anchored and presented in its 
bioactive conformation would be suitable. 

Aptamers can then specifically bind to a given target protein in vifro and in vivo and 
have the potential to selectively block the function of their target protein. Peptide aptamers 
are selected from randomized expression libraries on the basis of their in vivo binding 
capacity to the desired target protein. Briefly, a target protein (e.g., Dkk, a Dkk interacting 
protein, or LRP5/6) is linked to a heterologous DNA binding domain (BD) and expressed as 
bait in a yeast test strain. Concomitantly, a library coding for different peptides (e.g., 16- 
mers) of randomized sequence inserted in a scaffold protein sequence, which are linked to a 
heterologous transcriptional activation domain (AD) is expressed as prey. If a peptide binds 
to a target protein, a functional transcription factor is reconstituted, in which the BD and AD 
are bridged together by interacting proteins. This transcription factor is then able to activate 
the promoter of a marker gene which can be monitored by colorimetric enzymatic assays or 
by growth selection. Additional variation, methods of preparing and screening 
methodologies are described in, for example, Hoppe-Seyler et al 9 2000 J. Mol Med. 78: 426- 
430. 

Nucleotide aptamers are described for example in Brody et al, 2000 Trends Mol 
Biotechnol 74: 5-13. Additional methods of making and using nucleotide aptamers include 
SELEX, z.e., Systematic Evolution of Ligands by Exponential Enrichment. SELEX is a 
process of isolating oligonucleotide ligands of a chosen target molecule (see Tuerk and Gold, 
Science 249:505-510 (1990); U.S. Pat. Nos. 5,475,096, 5,595,877, and 5,660,985). SELEX, 
as described in Tuerk and Gold, involves admixing the target molecule with a pool of 
oligonucleotides (e.g., RNA) of diverse sequences; retaining complexes formed between the 
target and oligonucleotides; recovering the oligonucleotides bound to the target; reverse- 
transcribing the RNA into DNA; amplifying the DNA with polymerase chain reactions 
(PCR); transcribing the amplified DNA into RNA; and repeating the cycle with ever 
increasing binding stringency. Three enzymatic reactions are required for each cycle. It 
usually takes 12-15 cycles to isolate aptamers of high affinity and specificity to the target. 
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An aptamer is an oligonucleotide that is capable of binding to an intended target substance 
but not other molecules under the same conditions. 

In another reference, Bock et aL, 1990 Nature 355: 564-6, describe a different process 
from the SELEX method of Tuerk and Gold in that only one enzymatic reaction is required 
for each cycle (i.e., PCR) because the nucleic acid library in Bock's method is comprised of 
DNA instead of RNA. The identification and isolation of aptamers of high specificity and 
affinity with the method of Bock et aL still requires repeated cycles in a chromatographic 
column. 

Other nucleotide aptamer methods include those described by Conrad et aL, 1996 
Metk EnzymoL 267: 336-367. Conrad et aL describe a variety of methods for isolating 
aptamers, all of which employ repeated cycles to enrich target-bound ligands and require a 
large amount of purified target molecules. More recently described methods of making and 
using nucleotide aptamers include, but are not limited to those described in U.S. Patent Nos. 
6,180,348; 6,051,388; 5,840,867; 5,780,610, 5,756,291 and 5,582,981. 

Potentially, any compound which binds Dkk or a Dkk interacting protein or 
modulates the Dkk/Dkk interacting protein or Dkk/LRP5, Dkk/LRP6, Dkk/HBM, or 
Dkk/HBM-like complex may be a therapeutic compound. In one embodiment of the 
invention, a peptide or nucleic acid aptamer according to the invention is used in a 
therapeutic composition. Such compositions may comprise an aptamer, or a LRP5 or LRP6 
fragment unmodified or modified. 

Nucleic acid aptamers have been used in compositions for example by chemical 
bonding to a carrier molecule such as polyethylene glycol (PEG) which may facilitate uptake 
or stabilize the aptamer. A di-alkylglycerol moiety attached to an RNA will embed the 
aptamer in liposomes, thus stabilizing the compound. Incorporating chemical substitutions 
(i.e., changing the 2'-OH group of ribose to a 2'-NH in RNA confers ribonuclease resistance) 
and capping, etc. can prevent breakdown. Several such techniques are discussed for RNA 
aptamers in Brody et aL, 2000 Rev. MoL Biol. 74: 3-13. 

Peptide aptamers may by used in therapeutic applications by the introduction of an 
expression vector directing aptamer expression into the affected tissue such as for example by 
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retroviral delivery, by encapsulating the DNA in a delivery complex or simple by naked 
DNA injection. Or, the aptamer itself or a synthetic analog may be used directly as a drug. 
Encapsulation in polymers and lipids may assist in delivery. The use of peptide aptamers as 
therapeutic and diagnostic agents is reviewed by Hoppe-Syler et al, 2000 1 Mol Med. 78: 
426-430. 

In another aspect of the invention, the structure of a constrained peptide aptamer of 
the invention may be determined such as by NMR or X-ray crystallography. (Cavanagh et al, 
Protein NMR Spectroscopy : Principles and Practice J Academic Press, 1996; Drenth, 
Principles of Protein X-Ray Crystallography, Springer Verlag, 1999) Preferably the structure 
is determined in complex with the target protein. A small molecule analog is then designed 
according to the positions of functional elements of the 3D structure of the aptamer. 
(Guidebook on Molecular Modeling in Drug Design, Cohen, Ed., Academic Press, 1996; 
Molecul ar Modeling and Drug Design (Topics in Molecular and Structural Biology) , Vinter 
and Gardner Eds., CRC Press, 1994) Thus, a method is provided for the design of effective 
and specific drugs which modulate the activity of Dkk, Dkk interacting proteins, Dkk/Dkk 
interacting protein complex, and the Dkk/LRP complex. Small molecule mimics of the 
peptide aptamers of the present invention are also encompassed within the scope of the 
invention. 

EXAMPLES 

The present invention is described by reference to the following Examples, which are 
offered by way of illustration and are not intended to limit the invention in any manner. 
Standard techniques well known in the art or the techniques specifically described below 
were utilized. 

Example 1 

The propositus was referred by her physicians to the Creighton Osteoporosis Center 
for evaluation of what appeared to be unusually dense bones. She was 1 8 years old and came 
to medical attention two years previous because of back pain, which was precipitated by an 
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auto accident in which the car in which she was riding as a passenger was struck from 
behind. Her only injury was soft tissue injury to her lower back that was manifested by pain 
and muscle tenderness. There was no evidence of fracture or subluxation on radiographs. 
The pain lasted for two years, although she was able to attend school full time. By the time 
she was seen in the Center, the pain was nearly resolved and she was back to her usual 
activities as a high school student. Physical exam revealed a normal healthy young woman 
standing 66 inches and weighing 128 pounds. Radiographs of the entire skeleton revealed 
dense looking bones with thick cortices. All bones of the skeleton were involved. Most 
importantly, the shapes of all the bones were entirely normal. The spinal BMC was 94.48 
grams in Ll-4, and the spinal BMD was 1.667 gm/cm 2 in Ll-4. BMD was 5.62 standard 
deviations (SD) above peak skeletal mass for women. These were measured by DXA using a 
Hologic 200CK Her mother was then scanned and a lumbar spinal BMC of 58.05 grams and 
BMD of 1.500 gm/cm 2 were found. Her mother's values place her 4.12 SD above peak mass 
and 4.98 SD above her peers. Her mother was 51 years old, stood 65 inches and weighed 
140 pounds. Her mother was in excellent health with no history of musculoskeletal or other 
symptoms. Her father's lumbar BMC was 75.33 grams and his BMD was 1.118 gm/cm 2 . 
These values place him 0.25 SD above peak bone mass for males. He was in good health, 
stood 72 inches tall, and weighed 187 pounds. 

These clinical data suggested that the propositus inherited a trait from her mother, 
which resulted in very high bone mass, but an otherwise normal skeleton, and attention was 
focused on the maternal kindred. In U.S. Patent No. 5,691,153, twenty- two of these 
members had measurement of bone mass by DXA. In one case, the maternal grandfather of 
the propositus, was deceased, however, medical records, antemortem skeletal radiographs 
and a gall bladder specimen embedded in paraffin for DNA genotyping were obtained. His 
radiographs showed obvious extreme density of all of the bones available for examination 
including the femur and the spine, and he was included among the affected members. In this 
invention, the pedigree has been expanded to include 37 informative individuals. These 
additions are a significant improvement over the original kinship (Johnson et aL, Am. J. Hum. 
Genet, 60:1326-1332 (1997)) because, among the fourteen individuals added since the 
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original study, two individuals harbor key crossovers. X-linkage is ruled out by the presence 
of male-to-male transmission from individual 12 to 14 and 15. 

Example 2 

5 The present invention describes DNA sequences derived from two BAC clones from 

the iffiMgene region, as evident in Table 7 below, which is an assembly of these clones. 
Clone b200e21-h (ATCC No. 980812; SEQ ID NOS: 10-11) was deposited at the American 
Type Culture Collection (ATCC), 10801 University Blvd., Manassas, VA 201 10-2209 
U.S.A., on December 30, 1997. Clone b527dl2-h (ATCC No. 980720; SEQ ID NOS: 5-9) 
10 was deposited at the American Type Culture Collection (ATCC), 10801 University Blvd., 
Manassas, VA 201 10-2209 U.S.A., on October 2, 1998. These sequences are unique 
reagents that can be used by one skilled in the art to identify DNA probes for the Zmaxl 
gene, PCR primers to amplify the gene, nucleotide polymorphisms in the Zmaxl gene, or 
regulatory elements of the Zmaxl gene. 

15 

Example 3 

Yeast-2 Hybrid screen for peptide aptamer seq uence s to Dkk-1 
Peptide aptamer library construction. A peptide aptamer library, Tpep, was 
constructed, which provides a means to identify chimeric proteins that bind to a protein target 
20 (or bait) of interest using classic yeast two hybrid (Y2H) assays. The Tpep library is a 

combinatorial aptamer library composed of constrained random peptides, expressed within 
the context of the disulfide loop of E. coli thioredoxin (trxA), and as C-termini fusion to the 
S. cerevisiae Gal4 activation domain. The Tpep library was generated using a restriction 
enzyme modified recombinant Y2H prey vector, pPC86 (Gibco), which contains the trxA 
25 scaffold protein. 

Generation of aptamer-encoding sequences. Aptamer-encoding sequences were 
produced as follows. DNA encoding random stretches of approximately sixteen amino acids 
surrounded by appropriate restriction sites were generated by semi-random oligonucleotide 
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synthesis. The synthetic oligonucleotides were PCR-amplified, restriction digested, and 
cloned into the permissive sites within the trxA scaffold protein. The cloning strategy was to 
insert the random oligonucleotide sequence is in-frame with the scaffold protein coding 
sequence, resulting in expression of a scaffold protein-aptamer chimera. The scaffold protein 
5 is itself in-frame with the activation domain of Gal4, within the pPC86 vector that is 
appropriate for the aptamer to be expressed and functional in a regular Y2H assay. 
Additional methods of preparing aptamers would be apparent to the skilled artisan. 

Generation of a permissive recombinant pPC86 vector containing the trxA coding 
sequence. First the Rsj'U restriction site located within the Gal4 activation domain of pPC86 

10 (Gibco) was eliminated by site-directed mutagenesis (Quickchange™ kit, Stratagene). The 

amino acid sequence of the Gal4 activation domain was unchanged by this modification. The 
strength of different control interactions was verified to be unchanged by the modification. 

Second, the E. coli trxA coding sequence was cloned into the Sail and Notl sites of 
the iforll-modified pPC86. EcoBl and Spel sites were then introduced within the trxA RsrU 

15 site. The oligonucleotides encoding the peptide aptamers were cloned into the EcoRI and 
Spel sites of the resulting vector. 



TABLE 13 



Contig 


ATCC No. 


SEQ ID 


Length 






NO. 


(base pairs) 


b527dl2-h contig302G 


980720 


5 


3096 


b527dl2-h contig306G 


980720 


6 


26928 


b527dl2-h contig307G 


980720 


7 


29430 


b527dl2-h contig308G 


980720 


8 


33769 


b527dl2-h contig309G 


980720 


9 


72049 


b200e21-h contigl 


980812 


10 


8705 


b200e21-h contig4 


980812 


11 


66933 



Example 4 
Generation of antibodies 

30 In each of the following antibody-generating examples, the synthesis of these linear 

peptides is followed by injection into two New Zealand Rabbits. Subsequent boosts and 
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bleeds are taken according to a standard ten-week protocol. The end-user receives back 5 
mgs of peptide, aliquots of pre-bleeds, roughly 80 ml of crude sera from each of the two 
rabbits and, and ELIS A titration data is obtained. 

Generation of LRP5 Polymorphism-specific antibodies. Antibodies were generated 
to the following peptides to obtain antibodies which distinguish the HBM polymorphism 
versus wild-type LRP5/Zmax: MYWTDWVETPRIE (SEQ ID NO: ) (mutant peptide) and 
MYWTDWGETPRffi (SEQ ID NO: ) (wild-type peptide for negative selection). 
Immunofluorescence data confirmed that the antibody, after affinity purification, is specific 
for HBM and does not recognize LRP5 (Fig. 32). 

Generation of LRP5 Monospecific antibodies. LRP5 monospecific polyclonal 
antibodies were generated to the following amino acid sequences of LRP5: Peptide 1 (a.a. 
265-277) - KRTGGKRKEELSA (SEQ ID NO: ), Peptide 2 (a.a. 1 178-1 194) - 
ERVEKTTGDKRTRIQGR (SEQ ID NO: ), and Peptide 3 (a.a. 1352-1375) - 
KQQCDSFPDCIDGSDE (SEQ ID NO: ). Immunofluorescence confirmed that the antibody 
generated detects LRP5. 

Generation ofDkk-1 monospecific polyclonal antibodies. Dkk-1 monospecific 
polyclonal antibodies were generated to the following amino acid sequences of Dkk-1: 
Peptide 1 - GNKYQTIDNYQPYPC (SEQ ID NO: ), 
Peptide 2- LDGYSRRTTLSSKMYHTKGQEG (SEQ ID NO: ), 
Peptide 3 - RIQKDHHQASNSSRLHTCQRH (SEQ ID NO: ), 
Peptide 4 - RGEIEETITESFGND (SEQ ID NO: ), and 

Peptide 5 - EIFQRCGEGLSCRIQKD (SEQ ID NO: ) of human Dkk-1. Western Blots 
demonstrated that the antibodies generated against peptides 2 and 4 are specific toward Dkk- 
1. 

Example 5 

Effects of exogenous Dkk-1 on Wnt-mediated signaling in the Xenopus embryo assay 
Xenopus embryos are an informative and well-established in vivo assay system to 
evaluate the modulation of Wnt signaling (McMahon et al, 1989 Cell 58: 1075-84; Smith et 
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al, 1991 reviewed in Wodarz and Nusse, 1998). 

Modification of the Wnt signaling pathway can be visualized by examining the 
embryos for a dorsalization phenotype (duplicated body axis) after RNA injection into the 
ventral blastomere at the 4- or 8-cell stage. On the molecular level, phenotypes can be 
5 analyzed by looking for expression of various marker genes in stage 10.5 embryos. Such 
markers would include general endoderm, mesoderm, and ectoderm markers as well as a 
variety of tissue-specific transcripts. 

Analysis can be done by RT-PCR/TaqMan® and can be done on whole embryo tissue 
or in a more restricted fashion (microdissection). Because this system is very flexible and 

10 rapid, by injecting combinations of transcripts, such as HBM and different Wnts or Wnt 
antagonists, the mechanism of HBM in the Wnt pathwaycan thereby be dissected. 
Furthermore, investigations are conducted to determine whether Zmax/LRP5 and HBM 
differentially modulate Wnt signaling either alone, or in combination with other components. 
Previous studies have demonstrated that LRP6 alone or LRP5 + Wnt5a were able to induce 

15 axis duplication (dorsalization) in this system (Tamai et al, 2000 Nature 407: 530-35). 

Constructs for Xenopus Expression (lector pCS2*). Constructs were prepared using 
the vector pCS2 + . DNA inserts were subcloned in the sense orientation with respect to the 
vector SP6 promoter. The pCS2 + vector contains an SV40 virus polyadenylation signal and 
T3 promoter sequence (for generation of antisense mRNA) downstream of the insert. 

20 Full len gth Zmay/LRP5 and HBM ORF cDNA : Insert cDNA was isolated from the 

full length cDNA retrovirus constructs (with optimized Kozak sequences) by BgttL-EcdRL 
digestion and subcloned into the BaniiR-EcdKl sites of the pCS2 + vector. cDNAs encoding a 
HBM-like molecule could be subcloned into pCS2 + vectors and processed similarly by one of 
ordinary skill. 

25 Full length XWntS: This cDNA was PCR amplified from a Xenopus embryo cDNA 

library using oligos 1 14484 (SEQ ID NO: ) (5'-CAGTGAATTCACCATGCAAAACACC 
ACTTTGTTC-3') and 114487 (SEQ ID NO: ) (5-CAGTTGCGGCCGCTCATCTCCGGTG 
GCCTCTG-3 1 ). The oligos were designed to amplify the ORF with a consensus Kozak 
sequence at the 5' end as determined from GenBank #X57234. PCR was carried out using 
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the following conditions: 96°C, 45 sec; 63°C, 45 sec; 72°C, 2 min. for 30 cycles. The 
resulting PCR product was purified, subcloned into pCRII-TOPO (Invitrogen Corp.), 
sequence verified, and digested with BajJiWJXliol. This insert was subcloned into the vector 
at the BaniHl-Xhol sites. 

Full length Wnt5a: A murine Wnt5a cDNA clone was purchased from Upstate 
Biotechnology (Lake Placid, NY) and subcloned into the EcoRl site of the vector. 
Sequencing confirmed insert orientation. 

Fn11 length human Dkk-1 : A human cDNA with GenBank accession number 
AF127563 was available in the public database. Using this sequence, PCR primers were 
designed to amplify the open reading frame with a consensus Kozak sequence immediately 
upstream of the initiating ATG. Oligos 1 17162 (SEQ ID NO: ) (5'- 
CAATAGTCGACGAATTCACCATGGCTCTGGGCGCAGCGG-3') and 117163 (SEQ ID 
NO: ) (5'-GTATTGCGGCCGCTCTAGATTAGTGTCTCTGACAAGTGTGAA-3') were 
used to screen a human uterus cDNA library by PCR. The resulting PCR product was 
purified, subcloned into pCRII-TOPO (Invitrogen Corp.), sequence verified, and digested 
with EcoBUXhol. This insert was subcloned into the pCS2 + vector at the EcoRl-Xliol sites. 

Full length human Dkk-2: A full length cDNA encoding human Dkk-2 was isolated 
to investigate the specificity of the Zmax/LRP5/HBM interaction with the Dkk family of 
molecules. Dkk-1 was identified in yeast as a potential binding partner of 
Zmax/LRP5/HBM. Dkk-1 has also been shown in the literature to be an antagonist of the 
Wnt signaling pathway, while Dkk-2 is not (Krupnik et at, 1999). The Dkk-2 full length 
cDNA serves as a tool to (fiscriminate the specificity and biological significance of 
Zmax/LRP5/HDBM interactions with the Dkk family {e.g., Dkk-1, Dkk-2, Dkk-3, Dkk-4, 
Soggy, their homologs and variant, etc). A human cDNA sequence for Dkk-2 (GenBank 
Accession No. NM_014421) was available in the public database. Using this sequence, PCR 
primers were designed to amplify the open reading frame with a consensus Kozak sequence 
immediately upstream of the initiating ATG. Oligos 51409 (SEQ ID NO: ) (5'- 
CTAACGGATCCACCATGGCCGCGTTGATGCGG-3') and 51411 (SEQ ID NO: ) (5'- 
GATTCGAATTCTCAAATTTTCTGACACACATGG-3') were used to screen human 
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embryo and brain cDNA libraries by PCR. The resulting PCR product was purified, 
subcloned into pCRH-TOPO, sequence verified, and digested with BamEI/EcoRI. This insert 
was subcloned into the pCS2 + vector at the BamHL-EcoRI sites. 

Full length LRP6 was isolated from the pED6dpc4 vector by Xliol-Xbal digestion. 
5 The full length cDNA was reassembled into the Xltol-Xbal sites of pCS2 + . Insert orientation 
was confirmed by DNA sequencing. 

mRNA Synthesis and Microinjection Protocol. mRNA for microinjection into 
Xenopus embryos is generated by in vitro transcription using the cDNA constructs in the 
pCS2 + vector described above as template. RNA is synthesized using the Ambion mMessage 

10 mMachine high yield capped RNA transcription kit (Cat. #1 340) following the 

manufacturer's specifications for the Sp6 polymerase reactions. RNA products were brought 
up to a final volume of 50 |il in sterile, glass-distilled water and purified over Quick Spin 
Columns for Radiolabeled RNA Purification G50-Sephadex (Roche, Cat. #1274015) 
following the manufacturer's specifications. The resulting eluate was finally extracted with 

15 phenol:chloroform:isoamyl alcohol and isopropanol precipitated using standard protocols 

(Sambrook et aL, 1989). Final RNA volumes were approximately 50 pi RNA concentration 
was determined by absorbance values at 260 nm and 280 nm. RNA integrity was visualized 
by ethidium bromide staining of denaturing (formaldehyde) agarose gel electrophoresis 
(Sambrook et aL, 1989). Various amounts of RNA (2 pg to 1 ng) are injected into the ventral 

20 blastomere of the 4- or 8-cell Xenopus embryo. These protocols are described in Moon et aL, 
1989 Technique-J. of Methods in Cell&Mol Biol 1: 76-89; and Peng, 1991 Metk Cell 
Biol 36: 657-62. 

Screening for Duplicated Body Axis. In vitro transcribed RNA is purified and 
injected into a ventral blasomere of the 4- or 8-cell Xenopus embryo (approx. 2 hours post- 
25 fertilization). At stage 10.5 (approx. 11 hours post-fertilization), the injected embryos are 
cultured for a total of 72 hours and then screened for the presence of a duplicated body axis 
(dorsalization) (Fig. 33). Using XWnt8-injected (2-10 pg) as a positive control (Christian et 
aL, 1991) and water-injected or non-injected embryos as negative controls, we replicated the 
published observation that Zmax(LRP5) + Wnt5a (500 and 20 pg, respectively) could induce 
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axis duplication. Wnt5a (20 pg) alone could not induce axis duplication (as previously 
reported by Moon et al, 1993). We have also injected GFP RNA (100-770 pg) as a negative 
control to show that the amount of RNA injected is not perturbing embryo development (not 
shown). Strikingly, HBM + Wnt5a (500 and 20 pg, respectively) yielded an approximately 

5 3.5 fold more robust response of the phenotype (p=0.043 by Fisher's exact test) compared to 
Zmax(LRP5) + Wnt5a, suggesting that the HBM mutation is activating the Wnt pathway 
(Figs. 34 and 35). The HBM/Wnt5a embryos also appear to be more "anteriorized" than the 
Zmax(LRP5)/Wnt5a embryos, again suggestive of a gain-of-function mutation. 

The role of Dkk-1 as a modulator of Zmax/LRP5- and HBM-mediated Wnt signaling 

10 was investigated. Literature reports have previously characterized Xenopus and murine Dkk- 
1 as antagonists of the canonical Wnt pathway in the Xenopus system (Glinka et al. , 1 998 
Nature 391: 357-62). Using the human Dkk-1 construct, a dose-response assay was 
performed to confirm that our construct was functional and to identify the optimal amount of 
RNA for microinjection. Using 250 pg/embryo of hDkk-1 RNA, over 90% (p<0.001) of the 

15 embryos were observed to display enlarged anterior structures (big heads) as anticipated from 
the published reports (Fig. 36). 

The mechanism of hDkk-1 modulation of Wnt signaling in the presence of 
Zmax/LRP5 or HBM was also investigated. Without any hDkk-1 present, it was confirmed 
that HBM + Wnt5a was a more potent activator of Wnt signaling than Zmax/LRP5 + Wnt5a 

20 (p<0.05). Interestingly, in the presence of hDkk-1 (250 pg), Zmax/LRP5-mediated Wnt 
signaling was repressed (p<0.05) but hDkk-1 was unable to repress HBM-mediated Wnt 
signaling (p<0.01) (Fig. 37). The specificity of this observation can be further addressed by 
investigating other members of the Dkk family, other Wnt genes, LRP6, additional 
Zmax/LRP5 mutants, and the peptide aptamers. 

25 

Example 6 

Effects of exogenous Dkk a nd T,RP5 on Wnt signaling in the TCF-luciferase Assay 
Wnt activity can be antagonized by many proteins including secreted Frizzled related 
proteins (SFRPs), Cerberus, Wnt Inhibitory Factor-1 and Dkk-1 (Krupnik et al, 1999). The 
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Dkk family of proteins consists of Dkk-1-4 and Soggy, a Dkk-3-like protein. Dkk-1 and 
Dkk-4 have been shown to antagonize Wnt mediated Xenopus embryo development, whereas 
Dkk-2, Dkk-3, and Soggy do not. Unlike many of these proteins that antagonize Wnt activity 
by directly interacting with Wnt proteins, Dkk-1 acts by binding to two recently identified 
5 Wnt coreceptors, LRP5 and LRP6 (Mao et aL, 2001 ; Bafico et aL, 2001). The details of this 
interaction have been examined by the present inventors and Mao et al. using deletion 
constructs of LRP6, which demonstrated that EGF repeats 3 and 4 are important for Dkk-1 
interaction. Accordingly, the activity of two Dkk proteins, Dkk-1 and Dkk-2, were 
investigated with various Wnt members, LRP5, LRP6, and the mutant form of LRP5, 

10 designated HBM. The present invention explores whether there is any functional difference 
between LRP5 and HBM with regard to Dkk action on Wnt mediated signaling. Various 
reagents were developed, including Dkk-1 peptides, constrained LRP5 peptide aptamers, 
constrained Dkk-1 peptide aptamers and polyclonal antibodies to Dkk-1 (in Example 4 
above) to identify factors that mimic HBM mediated Wnt signaling. 

15 Methods. Various LRP5 constrained peptides were developed. Specifically, four 

peptides that interact with the LBD of LRP5 (Fig. 38,constructs OST259-262 in Fig. 39) and 
three peptides that interact with the cytoplasmic domain of LRP5 (constructs OST266- 
OST268 in Fig. 39). In addition two Dkk-1 peptides were developed: constructs OST264 
and OST265 in Fig. 39, corresponding to Dkk-1 amino acids 139-266 and 96-245, containing 

20 the smallest region of Dkk-1 that interacts with LRP5 (Fig. 40). The cDNA clones encoding 
the LRP5 LBD interacting peptides and the Dkk-1 peptides were subcloned into pcDNA3.1 
with the addition of a Kozak and signal sequence to target the peptide for secretion. The 
constructs encoding the three peptides interacting with the cytoplasmic domain of LRP5 were 
also subcloned into pcDNA3.1. However, these latter constructs do not contain a signal 

25 sequence. 

HOB-03-CE6 osteoblastic cells developed by Wyeth Ayerst (Philadelphia, PA) were 
seeded into 24-well plates at 150,000 cells per well in 1 ml of the growth media (D- 
MEM/F12 phenol red-free) containing 10% (v/v) heat-inactivated FBS, IX penicillin 
streptomycin, and IX Glutamax-1, and incubated overnight at 34°C. The following day, the 
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cells were transfected using Lipofectamine 2000® (as described by the manufacturer, 
Invitrogen) in OptiMEM (Invitrogen) with 0.35 /-tg /well of LRP5, HBM, or control plasmid 
DNA (empty vector pcDNA3. 1) and either Wntl or Wnt3a plasmid DNA. Similar 
experiments were performed with LRP6 plasmid DNA (0.35,ug/well) or a control pEDdpc4 
5 empty vector. Furthermore, each of these groups were then divided into three groups, those 
receiving 0.35 ^g/well Dkk-1, Dkk-2, or pcDNA3.1 control DNA. All wells were 
transfected with 0.025 ^g/well of CMV beta-galactosidase plasmid DNA and 0.35 ^ug/well 
16X TCF(AS)-luciferase reporter DNA (developed by Ramesh Bhat, Wyeth-Ayerst 
(Philadelphia, PA)). After 4 hours of incubation, the cells were rinsed and 1 ml of fresh 
1 0 growth media was added to each well. The cells were cultured overnight at 34°C, followed 
by a wash and a change of media. Cells were cultured for an additional 18-24 hours at 37°C. 
Cells were then lysed with 50 jul/well of IX lysis buffer. The extracts were assayed for beta- 
galactosidase activity (Galacto Reaction Buffer Diluent & Light Emission Accelerator, 
Tropix) using 5 fA extract + 50 /A beta-galactosidase diluent and luciferase activity 
1 5 (Luciferase Assay Reagent, Promega) using 20 /A extract. 

U20S human osteosarcoma cells were also utilized. U20S cells (ATCC) were 
seeded into 96-well plates at 30,000 cells per well in 200 ul of the growth media (McCoy's 
5A) containing 10% (v/v) heat-inactivated FBS, IX penicillin streptomycin, and IX 
Glutamax-1, and incubated overnight at 37°C. The following day, the media was replaced 
20 with OptiMEM (Invitrogen) and cells were transfected using Lipofectamine 2000® (as 
described by the manufacturer, Invitrogen) with 0.005 ug/well of LRP5, HBM, LRP6 or 
contol plasmid DNA (empty vector pcDNA3.1) and either Wntl (0.0025 ug/well) or Wnt3a 
(0.0025 ug/well) plasmid DNA. In addition, the 16x-(AS) TCF-TK-firefly-luciferase and 
control TK-renilla luciferase (Promega Corp.) were co-transfected at 0.3 ug/well and 0.06 
25 ug/well respectively in all experiments. Furthermore, each of these groups was then divided 
into different groups, those receiving 0.05 ug/well Dkk-1, Dkk-2, Dkk3, Dkkl -Alkaline 
Phosphatase (AP), mutant Dkk-1 (C220A), Soggy or pcDNA3.1 control DNA. In other 
experiments, cells were co-transfected with 0.005 ug/well of LRP5, 0.0025 ug/well of Wntl 
or Wnt3a (using 0.0025 p.g/well of a control pcDNA3.1) with LRP5-interacting aptamers 
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(0.05 ^ig/well). Cells were cultured for an additional 18-20 hours at 37 °C. Culture medium 
was removed. Cells were cultured for an additional 18-20 hours at 37°C. Culture medium 
was removed. Cells were then lysed with 100 ^I/well of IX Passive Lysis Buffer (PLB) of 
Dual Luciferase Reagent kit (DLR-kit-Promega Corp.) 20 /A of the lysates were combined 
5 with LARII reagent of DLR-kit and assayed for TCF-firefly luciferase signal in Top Count 
(Packard) instrument. After measuring the Firefly readings, 100 jA of the "Stop and Glo M 
reagent of DLR kit that contains a quencher and a substrate for renilla luciferase was added 
into each well. Immediately the renilla luciferase reading was measured using the Top Count 
(Packard) Instrument. The ratios of the TCF-firefly luciferase to control renilla readings 
10 were calculated for each well and the mean ratio of triplicate or more wells was expressed in 
all data. 

Results. The results of these experiments demonstrate that Dkk-1 5 in the presence of 
Wntl and LRP5, significantly antagonized TCF-luciferase activity (Fig. 41). In marked 
contrast, Dkk-1 had no effect on HBM/Wntl mediated TCF-luciferase activity (Fig. 41). In 

15 similar experiments, Dkk-1 was also able to antagonize LRP5/Wnt3a but not HBM/Wnt3a 
mediated TCF-luciferase activity (Fig. 42). These results indicate that the HBM mutation 
renders Dkk-1 inactive as an antagonist of Wntl and Wnt3a signaling in HOB03CE6 
osteoblastic cells. In other experiments with Wntl, Dkk-1 had no effect on LRP5 or HBM 
mediated TCF-luciferase activity (Fig. 41). In contrast, with either LRP5 or HBM in the 

20 presence of Wnt3a, Dkk-2 was able to antagonize the TCF-luciferase activity (Fig. 42). 
These latter results indicate that the HBM mutation has no effect on Dkk-2 action in the 
presence of Wnt3a. Experiments were also performed using the closely related LRP6 cDNA 
in HOB-03-CE6 cells. In these experiments, LRP6/Wntl and LRP6/Wnt3a mediated TCF- 
luciferase were regulated in the same manner as LRP5. Specifically, Dkk-1 antagonized 

25 LRP6/Wntl mediated TCF-luciferase activity, whereas Dkk-2 had no effect (Fig. 41). 

However, similar to the action of Dkk-2 with LRP5/Wnt3a, Dkk-2 was able to antagonize 
LRP6/Wnt3a mediated TCF-luciferase activity (Fig. 42). 

The results in the U20S cells show a robust effect of the OST262 LRP5 peptide 
aptamer activation of Wnt signaling in the presence of Wnt3a (Fig. 43). These functional 
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results are confirmed by the results shown below in Example 7 using LRP5 peptide aptamers 
in the Xenopus assay. Such results affirmatively demonstrate that the effects of small 
molecules on LRP5/LRP6/HBM signaling can be detected using the TCF-luciferase assay. 
These data demonstrate that there is a functional difference between LRP5 and HBM 
5 regarding the ability of Dkk-1 to antagonize Wntl and Wnt3a signaling. These data and 

previous data showing that Dkk-1 directly interacts with LRP5 suggests that the inability of 
Dkk-1 to antagonize HBM/Wnt signaling may in part contribute to the HBM phenotype. 
These experiments further demonstrate the ability to test various molecules (e.g., small 
molecules, aptamers, peptides, antibodies, LRP5 interacting proteins or Dkk-1 interacting 
10 proteins, and the like) for a LRP5 ligand that mimics HBM mediated Wnt signaling or factors 
that block Dkk-1 interaction with LRP5. 

This was the assay that was used to show the responsiveness of two HBM-like 
variants. See Figs. 29 and 30. The data also demonstrates that these variants are less 
susceptible to modulation by Dkk. 

15 

Example 7 

Cell-Based Functional High-Throughput Assay 
To develop a high throughput assay, the TCF-luciferase assay described in Example 6 
was modified utilizing low level expression of endogenous LRP5/6 in U20S and HEK293 

20 cells. However, HOB-03-CE6 cells and any other cells which show a differential response to 
Dkk depending on whether LRP5, LRP6 or HBM are expressed. Using U20S (human 
osteosarcoma) and HEK293 (ATCC) cells, the TCF-luciferase and tk-Renilla reporter 
element constructs were co-transfected along with Wnt3a/1 and Dkk. Wnt3a alone, by using 
endogenous LRP5/6, was able to stimulate TCF reporter gene activation. When Dkk, is co- 

25 transfected with Wnt3a/Wnt 1 and reporters (TCF-luci and tk-Renilla), Dkk represses 

reporter element activity. In addition, the TCF-luci signal is activated by Wnt3a/Wntl can be 
repressed by the addition of Dkk-enriched conditioned media to the cells containing 
Wnt3a/Wntl and reporters. The assay is further validated by the lack of TCF-reporter 
inhibition by a point mutant construct (C220A) of Dkkl. 
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The Dkk-mediated repression of the reporter is dependent upon the concentration of 
transfected Dkk cDNA or on the amount of Dkk-conditioned media added. In addition, the 
Dkk-mediated reporter suppression can be altered by the co-transfection of LRP5, LRP6 5 and 
HBM cDNAs in the U20S or HEK293 cells. In general, U20S cells show greater sensitivity 
5 to Dkk-mediated reporter suppression than that in HEK-293 cells. In U20S cells, the 

transfection of LRP5/LRP6/HBM/HBM-like cDNA leads to moderate activation of TCF-luci 
in the absence of Wnt3a/Wntl transfection. This activation presumably utilizes the 
endogenous Wnts present in U20S cells. Under this condition, Dkkl can repress TCF-luci 
and shows a differential signal between LRP5 and HBM. By co-transfecting Wnt3a/Wntl, 

10 there is a generalized increase in the TCF-luci signal in the assay. Further, one can detect 
Dkk-mediated differential repression of the reporter due to LRP5 and HBM cDNA 
expression as well as between LRP5 and LRP6 cDNA. The repression is maximal with 
LRP6, moderate with LRP5, and least with HBM cDNA expression. In addition, the assay 
can detect the functional impact of the LRP5 interacting peptide aptamers (Fig. 38), Dkkl 

15 interacting aptamers and binding domains of Dkk-1 (Fig. 40; OST264 and OST265 of Figs. 
39 and 44). 

Using this system with a suppressed Wnt-TCF signal due to the presence of both Dkk 
and Wnt3a, one can screen for compounds that could alter Dkk modulation of Wnt signaling, 
by looking for compounds that activate or the TCF-luciferase reporter, and thereby relieve 

20 the Dkk-mediated repression of the Wnt pathway. Such compounds identified may 

potentially serve as HBM-mimetics and be useful, for example, as osteogenic therapeutics. 
Data generated from this high throughput screen are demonstrated in Figs. 45-47. Fig. 45 
shows that Dkkl represses Wnt3a-mediated signaling in U20S bone cells. Fig. 46 
demonstrates the functional differences between LRP5, LRP6, and HBM. Dkk-1 represses 

25 LRP6 and LRP5 but has little or no effect on HBM-generated Wntl signaling in U20S cells. 
Fig. 47 demonstrates the differential effects of various Dkk family members and modified 
Dkks, including Dkk-1, a mutated Dkk-1 (C220A), Dkk-l-AP (modified with alkaline 
phosphatase), Dkk-3, and Soggy. 



217 



WO 02/092000 



PCT/US02/14877 



Example 8 

PKK/T,RP5/6/HB M /ffB IVH ike ELTSA Assay 
A further method to investigate Dkk binding to LRP or HBM and HBM like 
polypeptides is via ELISA assay. Two possible permutations of this assay are exemplified. 
5 LRP5 is immobilized to a solid surface, such as a tissue culture plate well. One skilled in the 
art will recognize that other supports such as a nylon or nitrocellulose membrane, a silicon 
chip, a glass slide, beads, etc. can be utilized. In this example, the form of LRP5 used is 
actually a fusion protein where the extracellular domain of LRP 5 is fused to the Fc portion of 
human IgG. The LRP5-Fc fusion protein is produced in CHO cell extracts from stable cell 

10 lines. The LRP5-Fc fusion protein is immobilized on the solid surface via anti-human Fc 
antibody or by Protein- A or Protein G-coated plates, for example. The plate is then washed 
to remove any non-bound protein. Conditioned media containing secreted Dkk protein or 
secreted Dkk-epitope tagged protein (or purified Dkk or purified Dkk-epitope tagged protein) 
is incubated in the wells and binding of Dkk to LRP is investigated using antibodies to either 

15 Dkk or to an epitope tag. Dkk-V5 epitope tagged protein would be detected using an alkaline 
phosphatase tagged anti-V5 antibody. 

Alternatively, the Dkk protein could be directly fused to a detection marker, such as 
alkaline phosphatase. Here the detection of the Dkk-LRP interaction can be directly 
investigated without subsequent antibody-based experiments. The bound Dkk is detected in 

20 an alkaline phosphatase assay. If the Dkk-alkaline phosphatase fusion protein is bound to the 
immobilized LRP5, alkaline phosphatase activity would be detected in a colorimetric 
readout. As a result, one can assay the ability of small molecule compounds to alter the 
binding of Dkk to LRP using this system. Compounds, when added with Dkk (or epitope- 
tagged Dkk) to each well of the plate, can be scored for their ability to modulate the 

25 interaction between Dkk and LRP based on the signal intensity of bound Dkk present in the 
well after a suitable incubation time and washing. The assay can be calibrated by doing cold 
competition experiments with unlabeled Dkk or with a second type of epitope-tagged Dkk. 
Any small molecule that is able to modulate the Dkk-LRP interaction may be a suitable 
therapeutic candidate, more preferably an osteogenic therapeutic candidate. 
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Exampl e 9 

Functional Evaluation of Peptide Aptamers in Xenopus 
The constrained peptide aptamers constructs OST258-263 (where 258 contains the 
signal sequence by itself and 263 contains an irrelevant constrained peptide) (Figs. 39 and 
44) were used to generate RNA substantially as described in Example 6, except the vector 
was linearized by restriction endonuclease digestion and RNA was generated using T7 RNA 
polymerase. 

Aptamer RNA was injected at 250 pg per blastomere using the protocol of Example 
6. Wnt signaling was activated, as visualized by embryo dorsalization (duplicated body axis) 
with aptamers 261 and, more strongly, 262. The results of this assay are shown in Figs. 48 
and 49. These results suggest that aptamers 261 and 262 are able to activate Wnt signaling 
possibly by binding to the LBD of LRP, thereby preventing the modulation of LRP-mediated 
signaling by Dkk. 

The aptamers of the present invention can serve as HBM-mimetics. In the Xenopus 
system they are able to induce Wnt signaling all by themselves. They may also serve as tools 
for rational drug design by enhancing the understanding of how peptides are able to interact 
with LRP and modulate Wnt signaling at the specific amino acid level. Thus, one would be 
able to design small molecules to mimic their effects as therapeutics. In addition, the 
aptamers identified as positives in this assay may be used as therapeutic molecules 
themselves. 

Example 10 
Ho mogenous Assay 

An excellent method to investigate perturbations in protein-protein interactions is via 
Fluorescence Resonance Energy Transfer (FRET). FRET is a quantum mechanical process 
where a fluorescent molecule, the donor, transfers energy to an acceptor chromophore 
molecule which is in close proximity. This system has been successfully used in the 
literature to characterize the intermolecular interactions between LRP5 and Axin (Mao et al. 3 
Molec. Cell Biol 7: 801-9). There are many different fluorescent tags available for such 
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studies and there are several ways to fluorescently tag the proteins of interest. For example, 
CFP (cyan fluorescent protein) and YFP (yellow fluorescent protein) can be used as donor 
and acceptor, respecively. Fusion proteins, with a donor and an acceptor, can be engineered, 
expressed, and purified. 

5 For instance, purified LRP protein, or portions or domains thereof, fused to CFP and 

purified Dkk protein, or portions or domains thereof that interact with Dkk or LRP 
respectively, fused to YFP can be generated and purified using standard approaches. If LRP- 
CFP and Dkk- YFP are in close proximity, the transfer of energy from CFP to YFP will result 
in a reduction of CFP emission and an increase in YFP emission. Energy is supplied with an 

10 excitation wavelength of 450 nm and the energy transfer is recorded at emission wavelengths 
of 480 nm and 570 nm. The ratio of YFP emission to CFP emission provides a guage for 
changes in the interaction between LRP and Dkk. This system is amenable for screening 
small molecule compounds that may alter the Dkk-LRP protein-protein interaction. 
Compounds that disrupt the interaction would be identified by a decrease in the ratio of YFP 

15 emission to CFP emission. Such compounds that modulate the LRP-Dkk interaction would 
then be considered candidate HBM mimetic molecules. Further characterization of the 
compounds can be done using the TCF-luciferase or Xenopus embryo assays to elucidate the 
effects of the compounds on Wnt signaling. 

While the above example describes a cell-fee, solution-phase assay using purified 

20 components, a similar cell-based assay could also be performed. For example, LRP-CFP 

fusion protein can be expressed in cells. The Dkk- YFP fusion protein then could be added to 
the cells either as purified protein or as conditioned media. The interaction of LRP and Dkk 
is then monitored as described above. 

25 Example 11 

Identification of Variants of T JtP5/ZmaYl /HBM 

Because the YWTD repeats constitute the major part of domain which interacts with 
Dkk-1, a search was begun from the repeats to look for protein folds that contain such 
repeats. Springer, 1998 J. Mol Biol 283: 837-62 proposed that these YWTD repeats, which 

220 



WO 02/092000 



PCT/US02/14877 



had been previously described as "spacers" and considered to have no defined structure, are 
in fact the propeller-blade subdomains of highly structured six-bladed p-propellers. Springer 
(1998) therefore proposed two theoretically modeled protein structures with such YWTD 
repeats. 

5 The Model. A set of LRP5 propeller domain sequences from mouse and human were 

assembled and aligned using CLUSTALW, together with the sequences corresponding to two 
of the modeled YWTD-propeller structures of Springer (1998) [llpx, based on the LRP1 (3- 
propeller domain no. 7 from chicken (SwissProt LRP1_CH7), and lndx, based on human 
nidogen (Swiss-Prot NIDO_HUl)]. The alignment was manually edited based on the 

10 rationale and knowledge of protein structures. Fig. 50 depicts a schematic model of LRP5. 

The secondary structures consist exclusively of {3-strands and turns. The secondary structure 
assignments were verified by the predictive program, DSC (Discrimination of Protein 
Secondary Structure Class available at http://bioweb.pasteur.fr/seqanal/interfaces/dsc.html). 
Preliminary checks of the exon-intron boundaries were performed manually. A tertiary- 

15 structure homology model for human LRP5 using the llpx model as a template was built 
with Insightn (Accelrys Inc., San Diego, CA) and examined using the graphics display 
programs from Insight II and Rasmol (freely available at 
http://www.umass.edu/microbio/rasmol/index2.html). 

Results. Based on the homology alignments and modeling, the domain diagram of 

20 LRP5 in Fig. 50 was obtained. Fig. 51 shows the complete alignment and secondary 

structure assignments for the propeller domains of mouse LRP5 and LRP6, human LRP5 and 
human LRP6, and the sequences corresponding to the two theoretical models constructed by 
Springer (1998). 

To a good first approximation, the sequence alignments support the proposal that 
25 there are four 6-bladed P-propeller domains in human LRP5. This model nicely 

accommodates the sequence (as should be expected since it derives from a carefully 
constructed model of a chicken LRP propeller domain). The overall shape of this domain 
resembles a disk with inward-sloping sides and a hole down the middle. The polypeptide 
chain enters and leaves from the same (comparatively flat) "bottom" surface. As indicated in 
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the alignment (Fig. 51) and in several of the structure illustrations, the G171V mutation falls 
into a loop on the outer or "top" face of the domain. This immediately suggests that - 
contrary to what one might expect, given the role of highly conserved glycine residues in 
detennining protein folds - the mutation should not have any significant effect on the 
5 domain's tertiary structural stability. It is rather more likely that the mutation interferes with 
binding of some ligand, possibly a macromolecule (Smith et ah, 1999 Trends Biochem. Sci. 
24: 181-5) or alters a protein-protein interaction. 

Closer scrutiny of the protein in the vicinity of G171 shows that this residue sits at the 
bottom of a pocket on the outer edge of the top surface of the domain. Moreover, one side of 

10 this pocket consists of a cluster of very hydrophobic side chains, while the other side has 
more polar groups, most notably glutamate 172. In the mutant form of the protein, with 
VI 71 in place of the glycine, the pocket largely disappears as the isopropyl group of valine 
replaces the hydrogen side chain of glycine. It is easy to understand how such a substitution 
could seriously disrupt ligand binding if the ligand normally protrudes into the pocket. By 

15 the same rationale, any other mutations that block such pockets in any of the four propellers 
could also result in impaired ligand binding. 

The reasonableness of the proposed model is based on Springer's results as well as 
additional data. For example, data derived from the typical features of such protein domains 
(including the well-studied YWTD repeats), e.g., that are robust and rigid tertiary structures. 

20 Loop conformations may vary somewhat, especially for the longer loops, but the basic 

features of the protein scaffold are almost certainly well predicted. Of course we would be 
on somewhat more secure ground if there were a crystal structure for one of these domains. 
That naturally implies that some of the more interesting features, such as the identity of 
residues exposed on the outer, putative ligand-binding surface, will also be well-predicted. 

25 Note that in Fig. 5 1 these residues are marked in red on the sequences corresponding to 

Springer's (1998) models. Even though different models could possibly be built that are not 
dependent on Springer's assumptions and lead to a different topology of the propeller, this 
would not alter the conclusion that the G171V mutation lies in a surface loop. 

At the very least, the model affords an opportunity to think about this protein in 
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molecular terms and that should facilitate both experimental design and evaluation of results, 
i.e.) candidate epitopes might be selected more rationally. One interesting aspect, evoked 
forcefully by the diagram in Fig. 50, is the question of how the different modular propeller 
domains interact. The EGF-like domains are well-studied (dozens of crystal and NMR 
structures exist, see for example Bork et al, 1996 Quart Rev. Biophys. 29: 1 19-67). The 
EGF-like domains consist mainly of a couple of p-hairpins, cross-linked by disulfide bonds, 
and in some cases they have tightly bound calcium ions, which must further stabilize their 
structure. Interestingly, in contrast to the YWTD propellers, the EGF-like domains have the 
polypeptide chain entering at one end and exiting at the other. Thus, mechanically, these 
domains could act as spacers, even swivels (since rotation can occur around the tails that 
extend from either end). Given the size of the propellers, there are almost certainly inter- 
domain interactions between them as they assemble in some higher-order structure, 
connected by the EGF-like domains. Because of the topology of the propellers, their "top" 
faces would have to face outward since the EGF-like domains will tie them together by their 
protruding N- and C-terminal extensions. 

A more sophisticated analysis of the P-propeller structure was modeled using X-ray 
crystallographic data from the LDL receptor. The primary amino acid sequences of the 
various beta propeller domains of LRP5 (and other domains) were used to develop homology 
models of their 3-dimensional structures according to the following method: 

(A) Search for suitable structural templates and check sequence identity 
with target. The ExNRL-3D database (derived from the Protein Data Bank 
(http://pdb.ccdc.cam.ac.uk/pdb/) was searched using BLASTP2 (Altschul et al, 1990 
/. Mol Biol 215: 403-10; Huang etaL, 1991 Adv. Appl Math. 12: 337-67; and 
Peitsch, 1995 PDB Quart. Newsletter 12: 4) to find all similarities of the target 
sequence with sequences of known structure. This program selects all templates with 
sequence identities above 25% and projected model size greater than 20 residues. 
This step also detects domains that can be modelled based on unrelated templates. 
Use of this step resulted in the selection of Protein Data Bank (PDB) structure, 1IJQ 
("Crystal Structure Of The LDL Receptor YWTD-EGF Domain Pair' 6 ) (Jeon et al 9 
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2001 Nat Struct. Biol 8: 499). 

(B) Create ProModll (Altschul et al, 1990; Huang et al, 1991; and 
Peitsch, 1995) jobs and generate models with ProModIL 

(C) Superimpose related 3D structures. This method is based upon the 
diagonals of sequence similarity (the Dynamic sequence alignment algorithm 
(SM)(Altschul et al, 1990; Huang et al, 1991; and Peitsch, 1995). Specifically, the 
folloAving steps are performed: 

(i) Primary match: Regions with sequence similarity are selected automatically 
(manual selection is also possible) and the corresponding residues matched in three- 
space, (ii) The primary match is further refined using expanding context spheres. 

(D) Generate a multiple alignment comprising the sequences to be modeled. 

(E) Generate a framework for the new sequence. 

(F) Based on the topological arrangement of corresponding atoms: (a) atoms 
which occupy a similar portion of space and are expected to have a structural 
counterpart in the new structure are used to compute the framework coordinates 
(averaged positions); (b) Side chains with fully incorrect geometries are removed. 

(G) Rebuild the lacking loops based on the geometry of the loop stems: (a) the 
stems of the loops to rebuild are used to scan a database of structural fragments 
derived from the Brookhaven Data Bank. Either the best fitting fragment or a 
framework derived from the five best fragments, is used as the new loop, or (b) the 
conformational space is searched using a CSP approach (seven allowed angle 
combinations, space allocation for the loop, space allocation for each a-carbon in the 
loop). Both methods will place only a-carbons when loops are in steric conflict with 
the surrounding context. 

(H) Rebuilt incomplete backbones based on the position of the a-carbons, the 
backbone was rebuilt using a set of seven allowed O-Y angle combinations and a 
database of backbone fragments (a sliding window of five residues was run through 
the protein sequence. The best matching backbone fragment for each overlapping 
pentapeptide was then stored. A framework for the main chain atoms is derived from 
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these peptides, using only the coordinates of the three central residues of each 
pentapeptide. 

(I) The model's structural quality is verified and packing is checked as follows: 

(i) Verification of structural quality is based on the method described by 
R. Luthy et aL, 1992 Nature 356: 83-85. This method analyzes the 3D context 
of each residue and allows the identification of mis-folded regions. 

(ii) Packing is checked based on a probe accessible surface (Connoly 
surface) computation and a cubic grid passed through the structure. Inside and 
outside surfaces are detected and the center and size of each cavity is 
computed. The algorithm then compares the size and distribution of these 
cavities between the model and the structure in order to detect possibly mis- 
folded regions. 

(J) Refine structure by energy minimization and molecular dynamics based on 
force field computations following Gromos96 (Methods for the evaluation of long- 
range electi'ostatic forces in computer simulations of molecular systems, IN 
Computer Simulation of Biomolecular Systems, Theoretical and 
Experimental Applications, Vol. 2, 182-212 (W.F. van Gunsteren et aL, eds., 
Escom Science Publishers, Leiden, The Netherlands, 1993); van Gunsteren et al. 7 
1990 Angew. Chenu Int. Ed. Engl 29: 992-1023; van Gunsteren et aL, 1992 Eur. J. 
Biochem. 204: 947-961; Torda and W.F. van Gunsteren. Molecular Modeling Using 
Nuclear Magnetic Resonance Data, IN REVIEWS IN COMPUTATIONAL CHEMISTRY, 
Vol. m, 143-172 (K. B. Lipkowitz et aL, eds., VCH Publishers, Inc. New York, 
1992); van Gunsteren, Molecular dynamics and stochastic dynamics simulation: A 
primer, in COMPUTER SIMULATION OF BIOMOLECULAR SYSTEMS, THEORETICAL AND 

Experimental Applications 3-36; van Gunsteren et aL, Computation of free energy 
in practice: choice of approximations and accuracy limiting factors. IN COMPUTER 
Simulation of Biomolecular Systems, Theoretical and Experimental 
APPLICATIONS 3 15-348; Smith et aL, Methods for the evaluation of long-range 
electi'ostatic forces in computer simulations of molecular systems, IN COMPUTER 
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Simulation of Biomolecular Systems, Theoretical and Experimental 
Applications 182-212; van Gunsteren el aL, Accounting for Molecular Mobility in 
Structure Determination Based on Nuclear Magnetic Resonance Spectroscopic and 
X-Ray Diffraction Data, in Methods IN ENZYMOLOGY: NUCLEAR MAGNETIC 
RESONANCE, Vol. 239 619-654 (T.L. James et aL, eds., Academic Press, New York, 
1994); van Gunsteren et al, 1994 Quart. Rev. Biophysics 27: 435-481; van 
Gunsteren et aL, 1995 Biomolecular Modelling: Overview of Types of Methods to 
Search and Sample Conformational Space, IN PROCEEDINGS OF THE 1st European 
Conference on Computational Chemistry, American Institute of Physics 
CONF. Proc. 330: 253-268; and van Gunsteren et al., 1995 Computer Phys. 
Communications 91: 305-319). 

(K)The structure is then manually refined based on sequence alignment and structural 
elements using Deep View, the SWISS-PDB Viewer (Guex et aL, 1997 
Electrophoresis 18: 2714-23). 

Uses of structural model Having a three-dimensional model of the protein domain 
allows one to appreciate the context of the HBM mutation, or other HBM-like mutations, in 
three dimensional space. This facilitates the prediction of possible mechanisms of action in a 
manner impossible in any other way because one cannot predict from primary sequence alone 
the proximity of one distant (in sequence space) amino acid to another. This also allows one 
to make functional predictions that can then be tested using molecular/cellular biology 
techniques to further refine the model and validate the target. For example, the structural 
model of the G171 V mutation predicted an alteration in side chain-side chain interactions, so 
we were able to use this information to successfully predict similar mutations in other amino 
acids that would have analogous structural effects. Overlaying other functional data on this 
model would also able highlight other potentially important regions of the protein domain. 
Additionally, such models show the accessible residues, which would be useful for 
developing small compounds, antibodies and the like which recognize and bind to this 
region. 
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Results. Based on the space filling hypothesis set forth in Section 3 above, two 
residues from blade 6 of propeller 1 were identified as being in structurally equivalent 
locations to residue 171 in blade 4. Accordingly, substitutions with valine were engineered: 
F241V andA242V. 

5 Residues that were predicted to be accessible to the interior surface of the propeller 

were identified in both blades 3 and 5 of Zmaxl (LRP5). Original predictions from the 
Springer model identified G199 on blade 5 and E128 on blade 3 as the equivalent position to 
residue 171 of blade 4 (HBM). Using the more sophisticated model described above, G199 
and E128 are not predicted to be in the equivalent position to residue 171 of blade 4. Based 
10 on the new, more sophisticated model, the following residues were chosen for valine 
substitutions: 

Blade No. Mutations 

5 L200V, T201 V and I202V 

3 S127V 

15 

Another residue T125 was selected for a conservative substitution with T125S as well as a 
less conservative substitution of T125G. 

The role of the YWTD repetitive motifs in the P-propeller was examined by carrying 
out an alanine replacement scan over this region in Blade 4. These repeats are predicted to 
20 create the P-sheet blades of the propeller. Based on this model data, the following additional 
mutations were prepared in propeller 1: Y164A, M165A, Y166A, W167A, T168A, and 
D169A. These substitutions would be predicted to be detrimental to the P-propeller structure. 

The spacing filling or occupied space model was further examined by mutating a 
residue predicted by the above methods to reside on the exterior surface of propellar 1, blade 
25 4. This mutation, K21 5V (K- V at position 215), was selected because it was believed to not 
produce an HBM-like effect. 

Based on this information, the following model of the HBM G171V mutation was 
obtained. Specifically, substitutions of valine at specific locations within propeller 1 could 
result in the creation of hydrophobic patches on the surface of the propeller. For example, 
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the V at residue 171 of the HBM mutant comes in close proximity to L150 on the adjacent 
strand, potentially creating a hydrophobic patch that was not present with the wild type 
G171. An example of this prediction, and others are presented in a separate file (Fig. 52). 

Based on this model, a mutation of L150G which would remove the hydrophobic side 
5 chain at this site may not produce an HBM effect In addition, a double mutant of G171V 
(HBM) and L150G may not produce an HBM effect due to the elimination of the 
hydrophobic side chain interactions. The model can also be used to test this prediction on a 
different propeller 1 blade. The substitution A214V was shown to result in an HBM effect. 
1194 is the residue that is in the equivalent position to A214 on blade 5 as L150 is to G171 on 

10 blade 4. While the mutation of I194G alone may not be predicted to result an HBM effect, 
the double mutant of I194G and A214V may no longer permit a hydrophobic patch to form 
and as a result not generate the HBM effect observed with A214V. Additional analyses of 
the residues in proximity to A214 that may contribute to the formation of hydrophobic 
patches identified P240 and F241. Mutations of either P240G or F241G together with 

15 A214V may no longer permit the formation of hydrophobic patches and may eliminate the 
HBM effect observed with A2 1 4V alone. 

Lastly, the new model was used to investigate specific aspects of the other three 
propellers. The relevance of residues R494, R570 and V667 in the three other propellars was 
investigated. These three residues were chosen since they have been reported to be the site of 

20 osteoporosis pseudoglioma disease causing missense mutations: R494Q, R570W and V667M 
(Gong et al, 2001 Cell 107: 513-523). Based on the structural location of these residues on 
the outer surface of the propeller, the model would not predict that these substitutions are 
either gain- or loss-of-function substitutions (Fig. 53A-C). For example, the mutations 
related to osteoporosis-pseudoglioma syndrome (OPPG) result in patients having very low 

25 bone mass and who are prone to developing fractures and deformation. This genetic 

condition is known now to result from mutations in LRP5 (Gong et aL 9 2001 Cell 107: 513- 
23). Based on this information, this mutation would be categorized as a loss in function 
mutation. 

As a result, we have engineered these receptor variant to functionally test. These new 
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receptor variants were generated exactly as described above with the following 



oligonucleotides: 





L150GF 
L150GR 


GAAGGTGCTCTTCTGGCAGGACGGTGACCAGCCGAGGGCC 
GGCCCTCGGCTGGTCACCGTCCTGCCAGAAGAGCACCTTC 




I194GF 
I194GR 


GATCATTGTGGACTCGGACGGTTACTGGCCCAATGGACTG 
CAGTCCATTGGGCCAGTAACCGTCCGAGTCCACAATGATC 


10 


L200VF 
L200VR 


CATTTACTGGCCCAATGGAGTGACCATCGACCTGGAGGAGCAGAAGC 
GCTTCTGCTCCTCCAGGTCGATGGTCACTCCATTGGGCCAGTAAATG 




T201VF 
T201VR 


CATTTACTGGCCCAATGGACTGGTCATCGACCTGGAGGAGCAGAAGC 
GCTTCTGCTCCTCCAGGTCGATGACCAGTCCATTGGGCCAGTAAATG 


15 


I202VF 
I202VR 


CATTTACTGGCCCAATGGACTGACCGTCGACCTGGAGGAGCAGAAGC 
GCTTCTGCTCCTCCAGGTCGACGGTCAGTCCATTGGGCCAGTAAATG 




T125SF 
T125SR 


GGCAAGAAGCTGTACTGGTCGGACTCAGAGACCAACCGCATC 
GATGCGGTTGGTCTCTGAGTCCGACCAGTACAGCTTCTTGCC 


T125GF 
T125GR 


GGCAAGAAGCTGTACTGGGGGGACTCAGAGACCAACCGCATC 
GATGCGGTTGGTCTCTGAGTCCCCCCAGTACAGCTTCTTGCC 


25 


S127VF 
S127VR 


GAAGCTGTACTGGACGGACGTAGAGACCAACCGCATCGAGGTG 
CACCTCGATGCGGTTGGTCTCTACGTCCGTCCAGTACAGCTTC 




F241VF 
F241VR 


CCTGACGCACCCCGTCGCCCTGACGCTCTCCGGGGACACTC 
GAGTGTCCCCGGAGAGCGTCAGGGCGACGGGGTGCGTCAGG 


30 


A242VF 
A242VR 


CCTGACGCACCCCTTCGTCCTGACGCTCTCCGGGGACACTC 
GAGTGTCCCCGGAGAGCGTCAGGACGAAGGGGTGCGTCAGG 


35 


K215VF 
K215VR 


CTACTGGGCTGACGCCGTGCTCAGCTTCATCCACCGTGCC 
GGCACGGTGGATGAAGCTGAGCACGGCGTCAGCCCAGTAG 


Y164AF 
Y164AR 


CTTGGACCCCGCTCACGGGGCCATGTACTGGACAGACTGG 
CCAGTCTGTCCAGTACATGGCCCCGTGAGCGGGGTCCAAG 


40 


M165AF 
M165AR 


GGACCCCGCTCACGGGTACGCGTACTGGACAGACTGGGGTG 
CACCCCAGTCTGTCCAGTACGCGTACCCGTGAGCGGGGTCC 




Y166AF 
Y166AR 


CCCGCTCACGGGTACATGGCCTGGACAGACTGGGGTGAGAC 
GTCTCACCCCAGTCTGTCCAGGCCATGTACCCGTGAGCGGG 


45 


W167AF 
W167AR 


CGCTCACGGGTACATGTACGCGACAGACTGGGGTGAGACGC 
GCGTCTCACCCCAGTCTGTCGCGTACATGTACCCGTGAGCG 
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T168AF 
T168AR 



GCTCACGGGTACATGTACTGGGCAGACTGGGGTGAGACGCC 
GGCGTCTCACCCCAGTCTGCCCAGTACATGTACCCGTGAGC 



D169AF 
D169AR 



CACGGGTACATGTACTGGACAGCCTGGGGTGAGACGCCCCG 
CGGGGCGTCTCACCCCAGGCTGTCCAGTACATGTACCCGTG 



R494QF 
R490QR 



CAACTTGGATGGGCAGGAGCAGCGTGTGCTGGTCAATGCCTC 
GAGGCATTGACCAGCACACGCTGCTCCTGCCCATCCAAGTTG 



R570WF 
R570WR 



GCAGCGCCGCAGCATCGAGTGGGTGCACAAGGTCAAGGCCAG 
CTGGCCTTGACCTTGTGCACCCACTCGATGCTGCGGCGCTGC 



V667MF 
V667MR 



CGAGACCAATAACAACGACATGGCCATCCCGCTCACGGGCG 
CGCCCGTGAGCGGGATGGCCATGTCGTTGTTATTGGTCTCG 



Point mutations in other molecules with beta-propellers have been described in the 
literature. For example, alpha4-integrin (Guerrero-Esteo et al., 1998 FEBS Letter 429: 123- 
8) has had two mutations introduced, which resulted in altered protein function (i.e., G130R 
and G190S). Both of these glycines are in structurally equivalent locations to G171 of LRP5, 
just located on different blades of the beta-propeller at the upper surface. Using the model 
discussed above, we mapped these residues of the alpha4-integrin and confirmed their 
placement. The result of these substitutions in the alpha4 integrin was loss of ligand binding 
and reduced affinity to its heterodimer binding partner: integrin-betal, such that the alpha4- 
betal (a4pl) heterodimer cannot be formed. Examples of modulation of other residues in 
integrin-alpha4 with functional consequences are referenced within Guerrero-Esteo et al 
(1998). Thus, these data support the notion that disruption of beta-propeller structure can 
result in significant and dramatic effects on receptor function. 

All references cited are herein incorporated by reference in their entirety for all 
purposes. The following applications are also incorporated by reference in their entirety 
herein: U.S. Application Nos. 09/543,771 and 09/544,398 filed on April 5, 2000, which are a 
continuation-in-part of Application No. 09/229,319, filed January 13, 1999, which claims 
benefit of U.S. Provisional Application No. 60/071,449, filed January 13, 1998, and U.S. 
Provisional Application No. 60/105,5 11, filed October 23, 1998. Additionally This 
application claims priority of Application Nos. 60/290,071 filed May 11, 2001; 60/291,311 
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filed May 17, 2001; 60/353,058 filed February 1, 2002, and 60/361,293 filed March 4, 2002 
the texts of which are herein incorporated by reference in their entirety for all purposes. 
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CT/ATMS 

What is claimed is: 

1 . A nucleic acid comprising a mutation in LRP5 or LRP6 which results in a 
HBM-like phenotype when expressed in a cell, wherein said HBM-like phenotype results in 
hone mass modulation and/or lipid level modulation. 

2. A LRP5 nucleic acid comprising at least one mutation of Tables 2 or 3 or 
which results in a mutation of one of the following G171V, A214V, A65V, M282V, G171K, 
G171F, G171I, G171Q, L200V, T201V, I202V, or S127V when expressed in a cell, and 
wherein expression of the nucleic acid in a subject results in bone mass modulation and/or 
lipid level modulation. 

3 . A LRP6 nucleic acid comprising at least one of Tables 2 or 3 or which results 
in a mutation of one of the following G171V, A214V, A65V, M282V, G171K, G171F, 
G171I, G171Q, L200V, T201V, I202V 5 or S127V when expressed in a cell, and wherein said 
mutation occurs in LRP6 in a position equivalent to LRP5 such that expression of the nucleic 
acid in a subject results in bone mass modulation and/or lipid level modulation. 

4. A LRP5 nucleic acid encoding at least one amino acid mutation in propeller 1 . 

5. The LRP5 nucleic acid of claim 4, wherein said nucleic acid encodes for at 
least one mutation selected from the group consisting of: G171V, A214V, A65V, M282V, 
G171K, G171F, G171I, G171Q, L200V, T201V, I202V, and S127V. 

6. A LRP6 nucleic acid encoding at least one amino acid mutation in propeller 1. 

7. The LRP6 nucleic acid of claim 6, wherein said nucleic acid encodes for at 
least one mutation in an equivalent position of LRP6 selected from the group of LRP5 
mutations consisting of: G171V, A214V, A65V, M282V, G171K, G171F, G171I, G171Q, 
L200V, T201V, I202V, and S127V. 

8. The nucleic acid of claims 1-7, wherein the mutation is G171 V, A214V, 

232 



WO 02/092000 



PCT/US02/14877 



A65V, M282V, G171K, G171F, G171I or G171Q in LRP5 or in an equivalent domain in 
LRP6, 

9. A polypeptide encoded by a nucleic acid of claims 1 to 8, wherein said 
polypeptide when expressed in a cell modulates Wnt signaling, Dkk activity, LRP5 activity 
and/or LRP6 activity. 

1 0. The polypeptide of claim 9, wherein said polypeptide modulates bone mass 
and/or lipid levels when expressed in a subject. 

11. A vector comprising a nucleic acid of claims 1 to 8. 

12. A cell comprising the vector of claim 1 1 . 

13. The cell of claim 12, wherein said cell is a cancer cell, a liver cell or a bone 

cell 

14. A polypeptide or biologically active fragment thereof derived from LRP5 
comprising at least one amino acid change of Table 2, G171V, A214V, A65V, M282V, 
G171K, G171F, G171I, G171Q, L200V, T201V, I202V, or S127V. 

15. The polypeptide or biologically active fragment of claim 14, wherein said 
protein or biologically active fragment comprises at least one amino acid change of G171 V, 
A214V, A65V, M282V, G171K, G171F, G171I, G171Q, L200V, T201V, I202V, or S127V. 

16. A polypeptide or biologically active fragment thereof derived from LRP6 
comprising at least one amino acid change of Table 2, G171V, A214V, A65V, M282V, 
G171K, G171F, G171I, G171Q, L200V, T201V, I202V, or S127V when the amino acid 
change is expressed in an equivalent position in LRP6. 

17. The polypeptide or biologically active fragment of claim 16, wherein said 
protein or biologically active fragment comprises at least one amino acid change of G171V, 
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A214V, A65V, M282V, G171K, G171F, G171I, G171Q, L200V, T201V, I202V, or S127V. 

18. A polypeptide or biologically active fragment which when expressed in a 
subject has a HBM-like phenotype comprising at least one amino acid mutation in propeller 1 
with the proviso that if the mutation is GV171 that a second mutation which maintains the 
HBM-like phenotype is also present. 

19. The polypeptide or biologically active fragment of claim 1 8, wherein said 
polypeptide is derived from LRP5, LRP6 or HBM. 

20. The polypeptide or biologically active fragment of claims 14-19, wherein the 
amino acid change is G171 V, A214V, A65V, M282V, G171K, G171F, G171I or G171Q of 
LRP5 or an equivalent position in LRP6. 

21. An antibody or immunogenic fragment thereof which binds to a polypeptide 
or antigenic fragment thereof of claims 14 to 20. 

22. The antibody or immunogenic fragment thereof of claim 21 , wherein the 
antibody is a monoclonal antibody, a chimeric antibody, a bispecific antibody, a humanized 
antibody, a primatized® antibody, a human antibody, or a labeled antibody. 

23 . An antibody which binds to a polypeptide comprising 
208 KLYWADAKLSFIHRAN 223 , 277 ALYSPMDIQVLSQER 291 , 61 GLEDAAAVDFQFSKGA 73 , 
234 EGSLTHPFALTLSG 247 , 249 TLYWTDWQTRSIHACN 264 , 144 VLFWQDLDQPRAI 156 , 
194 IYWPNGLTIDLEEQKLY 210 , 34 LLLFANRRDVRLVD 47 , 

75 GAVYWTDVSEEAIKQ 89 , 12l KLYWTDSETNRIEVA 135 of LRP5 or an equivalent domain 
on LRP6 or variants thereof. 

24. An antibody which binds to a polypeptide comprising 
969 LILPLHGLRNVK^ 989 KFIYWVDGRQNIKRAKDDGTQPFVL 1013 , 
1009 QPFVLTSLSQGQNPDRQPHDLSIDI 1033 , 1029 LSIDIYSRTLFWTCEA1OT 
1049 NVHRLSGEAMGVVL^ 1253 CGEPPTCSPDQFAC 1266 , 
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1278 WRCDGFPECDDQSDEEGC 1295 , 1316 RCDGEADCQDRSDEADC 1332 , 
1370 CEITKPPSDDSPAH 1383 of LRP5 or an equivalent domain on LRP6 or variants thereof, 
wherein said antibody modulates binding of LRP5 3 HBM, LRP6 or variants thereof to Dkk. 

25. A method of diagnosing a HBM-like phenotype in a subject comprising 

(A) obtaining a biological sample from the subject; 

(B) exposing the sample to an antibody or an immunogenic fragment of claims 
21-24; and 

(C) detecting whether the antibody bound a protein from the biological sample 
from the subject to determine whether the subject has a HBM-like phenotype. 

26. A composition for modulating bone mass and/or lipid levels in a subject 
comprising a therapeutically effective amount of an antibody of claims 21-24 and a 
pharmaceutical!^ acceptable carrier. 

27. An antibody or an immunogenic fragment thereof of claims 21-24, wherein 
the antibody or immunogenic fragment thereof can (1) discriminate between LRP5 and an 
HBM-like protein, (2) discriminate between LRP6 and an HBM-like protein, or (3) 
discriminate between HBM and an HBM-like protein. 

28. A transgenic animal having somatic and/or germ cells comprising a nucleic 
acid which comprises a promoter region that directs protein expression in animal and/or 
human cells operably linked to a nucleic acid of claims 1 to 8, and wherein said transgenic 
animal has at least three bone parameters modulated by the expression of said nucleic acid. 

29. The transgenic animal of claim 28, wherein the promoter region is selected 
from the group consisting of CMV, RSV, SV40, and EF-la, CMVpbActin, histone, type I 
collagen, TGFpl, SX2, cfos/cjun, Cbfal, Fra/Jun, Dlx5, osteocalcin, osteopontin, bone 
sialoprotein, and collagenase promoter regions. 

30. A transgenic animal having somatic and/or germ cells comprising a nucleic 
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acid which comprises a sequence which encodes a polypeptide of claims 9 or 10, and 
wherein the nucleic acid further comprises an operably linked promoter region that directs 
protein expression in animal and/or human cells, and wherein the transgenic animal has at 
least three bone parameters modulated by the expression of said nucleic acid. 

3L An animal embryo comprising a nucleic acid comprising a promoter region 
that directs protein expression in animal and/or human cells operably linked to a nucleic 
acid of claims 1 to 8. 

32. An animal or human cell transfected with a nucleic acid which comprises a 
promoter region that directs protein expression in animal and/or human cells operably 
linked to a nucleic acid of claims 1 to 8. 

33 . The transgenic animal of claim 28, wherein a human HBM-like protein is 
expressed and passes the human iffiM-like gene to its offspring. 

34. A transgenic animal produced from the transgenic animal of claim 33 or its 
offspring. 

35. An animal model for the study of bone density modulation and/or lipid level 
modulation comprising a first group of animals composed of the transgenic animal of claim 
28 and a second group of control animals. 

36. The animal model of claim 35, wherein the group of transgenic animals have 
cells which comprise a nucleic acid encoding human an HBM-like protein or biologically 
active polypeptide fragment thereof. 

37. An animal model for the study of bone density modulation comprising a first 
group of animals composed of the transgenic animals of claim 30 and a second group of 
control animals. 

38. A method of identifying agents which modulate the activity of an HBM-like 
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nucleic acid comprising: 

(a) transfecting a cell with a vector of claim 1 1 ; 

(b) exposing the transfected cell of step (a) to a compound; and 

(c) determining whether the compound modulates the activity of the HBM-like 
nucleic acid. 

39. A method of identifying agents which modulate the activity of an HBM-like 
protein comprising: 

(a) transfecting a cell with a vector of claim 1 1 ; 

(b) exposing the transfected cell of step (a) to a compound; and 

(c) determining whether the compound modulates the activity of the HBM-like 

protein. 

40. The method of claim 39, wherein the compound is a hormone, a growth factor, 
a peptide, RNA, shRNA, siRNA, DNA, a mineral, a vitamin, a natural product, or a synthetic 
organic compound. 

41 . A method for identifying compounds which modulate the interaction of Dick 
with the Wnt signaling pathway comprising: 

(a) transfecting cells with constructs containing a nucleic acid of claims 1 to 
claim 8; 

(b) assessing changes in expression of a reporter element linked to a Wnt- 
responsive promoter; and 

(c) identifying as a Dkk/Wnt interaction modulating compound any compound 
which alters reporter gene expression compared with cells transfected with a Dkk construct 
alone. 

42. The method according to Claim 41, wherein the cells are cancer cells, liver 
cells or bone cells. 

43. The method according to Claim 42, wherein the cells are U2-OS, HOB-03- 
CE6, orHEK293 cells. 



237 



WO 02/092000 



PCT/US02/14877 



44. The method according to Claim 41 , wherein the reporter element used is TCF- 
luciferase, tk-Renilla, or a combination thereof. 

45. A method of diagnosing a subject as expressing a nucleic acid comprising a 
nucleotide change of Tables 2 or 3, the method comprising the steps of: 

(A) obtaining a biological sample from the subject; and 

(B) assaying for the presence of the nucleotide change which results in HBM 
phenotype. 

46. A method for identifying agents which modulate LRP5, LRP6, or HBM 
comprising the steps of: 

(A) providing cells according to claim 32; 

(B) exposing the cells to a test compound; and 

(C) measuring the expression of LRP5, LRP6 or HBM respectively. 

47. The method for identifying agents of claim 42, further comprising a step of 
determining whether the agent further modulates bone mass and/or lipid levels. 

48. An agent identified by the method of claims 46 or 47. 
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Exon 1 

ACTAAAGCGCCGCCGCCGCGCCATGGAGCCCGAGTGAGCGCGGCGCG 
GGCCCGTCCGGCCGCCGGACAACATGGAGGCAGCGCCGCCCGGGCCG 
CCGTGGCCGCTGCTGCTGCTGCTGCTGCTGCTGCTGGCGCTGTGCGGC 
TGCCCGGCCCCCGCCGCGGCC 

Exon 2 Coordinates: 527dl2_Contig308G 30944-30549 

gccccacagCCTCGCCGCTCCTGCTATTTGCCAACCGCCGGGACGTACGGC 

TGGTGGACGCCGGCGGAGTCAAGCTGGAGTCCACCATCGTGGTCAGC 

GGCCTGGAGGATGCGGCCGCAGTGGACTTCCAGTTTTCCAAGGGAGC 

CGTGTACTGGACAGACGTGAGCGAGGAGGCCATCAAGCAGACCTACCT 

GAACCAGACGGGGGCCGCCGTGCAGAACGTGGTCATCTCCGGCCTGG * . , 

TCTCTCCCGACGGCCTCGCCTGCGACTGGGTGGGCAAGAAGCTGTACT 

GGACGGACTCAGAGACCAACCGCATCGAGGTGGCCAACCTCAATGGC 

ACATCCCGGAAGGTGCTCTTCTGGCAGGACCTTGACCAGCCGAGGGCC 

ATCGCCTTGGACCCCGCTCACGGgtaaaccctgctg 

... 9408 nt ... 

Exon 3 Coordinates: 527dl2_Contig308G 21141-20945 

ccccgtcacagGTACATGTACTGGACAGACTGGGGTGAGACGCCCCGGATTG 

AGCGGGCAGGGATGGATGGCAGCACCCGGAAGATCATTGTGGACTCG 

GACATTTACTGGCCCAATGGACTGACCATCGACCTGGAGGAGCAGAAG 

CTCTACTGGGCTGACGCCAAGCTCAGCTTCATCCACCGTGCCAACCTG 

GACGGCTCGTTCCGgtaggtacccac 

... 6094 nt ... 

Exon 4 Coordinates: 527dl2_Contig308G 15047-14850 

tccctgactgcagGCAGAAGGTGGTGGAGGGCAGCCTGACGCACCCCTTCGCC 

CTGACGCTCTCCGGGGACACTCTGTACTGGACAGACTGGCAGACCCGC 

TCCATCCATGCCTGCAACAAGCGCACTGGGGGGAAGAGGAAGGAGAT 

CCTGAGTGCCCTATACTCACCCATGGACATCCAGGTGCTGAGCCAGGA 

GCGGCAGCCTTTCTgtgagtgccgg 

... 1827 nt... 

Exon 5 Coordinates: 527dl2_Contig308G 13220-13088 

tttctcagTCCACACTCGCTGTGAGGAGGACAATGGCGGCTGCTCCCACCTG 

TGCCTGCTGTCCCCAAGCGAGCCTTTCTACACATGCGCCTGCCCCACG 

GGTGTGCAGCTGCAGGACAACGGCAGGACGTGTAAGGCAGgtgaggcggtgg 

gacg 



FIG. 3A 
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... 20923 nt ... 

Exon 6 Coordinates: 527dl2_Contig309G 7705-8100 

ctccacagGAGCCGAGGAGGTGCTGCTGCTGGCCCGGCGGACGGACCTAC 

GGAGGATCTCGCTGGACACGCCGGACTTCACCGACATCGTGCTGCAGG 

TGGACGACATCCGGCACGCCATTGCCATCGACTACGACCCGCTAGAGG 

GCTATGTCTACTGGACAGATGACGAGGTGCGGGCCATCCGCAGGGCG 

TACCTGGACGGGTCTGGGGCGCAGACGCTGGTCAACACCGAGATCAA 

CGACCCCGATGGCATCGCGGTCGACTGGGTGGCCCGAAACCTCTACTG 

GACCGACACGGGCACGGACCGCATCGAGGTGACGCGCCTCAACGGCA 

CCTCCCGCAAGATCCTGGTGTCGGAGGACCTGGACGAGCCCCGAGCC 

ATCGCACTGCACCCCGTGATGGGgtaagacgggc 

3211 nt 

Exon 7 Coordinates: 527dl2_Contig309G 11311-11482 

ttcttctccagCCTCATGTACTGGACAGACTGGGGAGAGAACCCTAAAATCGA 
GTGTGCCAACTTGGATGGGCAGGAGCGGCGTGTGCTGGTCAATGCCTC 
CCTCGGGTGGCCCAACGGCCTGGCCCTGGACCTGCAGGAGGGGAAGC 
TCTACTGGGGAGACGCCAAGACAGACAAGATCGAGgtgaggctcctgtgg 

13445 nt 

Exon 8 Coordinates: 527dl2_Contig309G 24927-25143 

ccgtcctgcagGTGATCAATGTTGATGGGACGAAGAGGCGGACCCTCCTGGA 

GGACAAGCTCCCGCACATTTTCGGGTTCACGCTGCTGGGGGACTTCAT 

CTACTGGACTGACTGGCAGCGCCGCAGCATCGAGCGGGTGCACAAGG 

TCAAGGCCAGCCGGGACGTCATCATTGACCAGCTGCCCGACCTGATGG 

GGCTCAAAGCTGTGAATGTGGCCAAGGTCGTCGgtgagtccggggggtc 

....2826 nt 

Exon 9 Coordinates: 527dl2_Contig309G 27969-28256 

gttcgcttccagGAACCAACCCGTGTGCGGACAGGAACGGGGGGTGCAGCCA 

CCTGTGCTTCTTCACACCCCACGCAACCCGGTGTGGCTGCCCCATCGG 

CCTGGAGCTGCTGAGTGACATGAAGACCTGCATCGTGCCTGAGGCCTT 

CTTGGTCTTCACCAGCAGAGCCGCCATCCACAGGATCTCCCTCGAGAC 

CAATAACAACGACGTGGCCATCCCGCTCACGGGCGTCAAGGAGGCCTC 

AGCCCTGGACTTTGATGTGTCCAACAACCACATCTACTGGACAGACGT 

CAGCCTGAAGgtagcgtgggc 

3102 

FIG. 3B 
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Exon 10 Coordinates: 527dl2_Contig309G 31358-31582 

cctgctgccagACCATCAGCCGCGCCTTCATGAACGGGAGCTCGGTGGAGCA 

CGTGGTGGAGTTTGGCCTTGACTACCCCGAGGGCATGGCCGTTGACTG 

GATGGGCAAGAACCTCTACTGGGCCGACACTGGGACCAACAGAATCGA 

AGTGGCGCGGCTGGACGGGCAGTTCCGGCAAGTCCTCGTGTGGAGGG 

ACTTGGACAACCCGAGGTCGCTGGCCCTGGATCCCACCAAGGGgtaagtgtt 

tgcctgtc 

1297nt 

Exon 11 Coordinates: 527dl2_Contig309G 32879-33064 

gtgccttccagCTACATCTACTGGACCGAGTGGGGCGGCAAGCCGAGGATCG 
TGCGGGCCTTCATGGACGGGACCAACTGCATGACGCTGGTGGACAAG 
GTGGGCCGGGCCAACGACCTCACCATTGACTACGCTGACCAGCGCCTC 
TACTGGACCGACCTGGACACCAACATGATCGAGTCGTCCAACATGCTG 

Ggtgagggccgggct 

2069 nt 

Exon 12 Coordinates: 527dl2_Contig309G 35133-35454 

gtgttcatgcagGTCAGGAGCGGGTCGTGATTGCCGACGATCTCCCGCACCCG 

TTCGGTCTGACGCAGTACAGCGATTATATCTACTGGACAGACTGGAAT 

CTGCACAGCATTGAGCGGGCCGACAAGACTAGCGGCCGGAACCGCAC 

CCTCATCCAGGGCCACCTGGACTTCGTGATGGACATCCTGGTGTTCCA 

CTCCTCCCGCCAGGATGGCCTCAATGACTGTATGCACAACAACGGGCA 

GTGTGGGCAGCTGTGCCTTGCCATCCCCGGCGGCCACCGCTGCGGCT 

GCGCCTCACACTACACCCTGGACCCCAGCAGCCGCAACTGCAGCCgtaag 

tgcctcatggt 

2006 nt 

Exon 13 Coordinates: 527dl2_Contig309G 37460-37659 

gcctcctctaCGCCCACCACCTTCTTGCTGTTCAGCCAGAAATCTGCCATCAG 

TCGGATGATCCCGGACGACCAGCACAGCCCGGATCTCATCCTGCCCCT 

GCATGGACTGAGGAACGTCAAAGCCATCGACTATGACCCACTGGACAA 

GTTCATCTACTGGGTGGATGGGCGCCAGAACATCAAGCGAGCCAAGGA 

CGACGGGACCCAGgcaggtgccctgtgg 

6965 nt 



FIG. 3C 
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Exon 14 Coordinates: 527dl2_Contig309G 44624-44832 

ctttgtcttacagCCCTTTGTTTTGACCTCTCTGAGCCAAGGCCAAAACCCAGAC 

AGGCAGCCCCACGACCTCAGCATCGACATCTACAGCCGGACACTGTTC 

TGGACGTGCGAGGCCACCAATACCATCAACGTCCACAGGCTGAGCGG 

GGAAGCCATGGGGGTGGTGCTGCGTGGGGACCGCGACAAGCCCAGGG 

CCATCGTCGTCAACGCGGAGCGAGGgtaggaggccaac 

1404 nt 

Exon 15 Coordinates: 527dl2_Contig309G 46236-46427 

ccaccctcccgcagGTACCTGTACTTCACCAACATGCAGGACCGGGCAGCCAA 
GATCGAACGCGCAGCCCTGGACGGCACCGAGCGCGAGGTCCTCTTCA 
CCACCGGCCTCATCCGCCCTGTGGCCCTGGTGGTGGACAACACACTGG 
GCAAGCTGTTCTGGGTGGACGCGGACCTGAAGCGCATTGAGAGCTGT 

GACCTGTCAGgtacgcgccccgg 

686 nt 

Exon 16 Coordinates: 527dl2_Contig309G 47113-47322 

ggctgcttgcagGGGCCAACCGCCTGACCCTGGAGGACGCCAACATCGTGCA 

GCCTCTGGGCCTGACCATCCTTGGCAAGCATCTCTACTGGATCGACCG 

CCAGCAGCAGATGATCGAGCGTGTGGAGAAGACCACCGGGGACAAGC 

GGACTCGCATCCAGGGCCGTGTCGCCCACCTCACTGGCATCCATGCAG 

TGGAGGAAGTCAGCCTGGAGGAGTTCTgtacgtgggggc 

3884 nt 

Exon 17 Coordinates: 527dl2_Contig309G 51206-51331 

ttgtctttgcagCAGCCCACCCATGTGCCCGTGACAATGGTGGCTGCTCCCACA 
TCTGTATTGCCAAGGGTGATGGGACACCACGGTGCTCATGCCCAGTCC 
ACCTCGTGCTCCTGCAGAACCTGCTGACCTGTGGAGgtaggtgtgacctaggtgc 

....3905 nt 

Exon 18 Coordinates: 527dl2_Contig309G 55236-55472 

gttctcctctgtccctcccccagAGCCGCCCACCTGCTCCCCGGACCAGTTTGCATGT 

GCCACAGGGGAGATCGACTGTATCCCCGGGGCCTGGCGCTGTGACGG 

CTTTCCCGAGTGCGATGACCAGAGCGACGAGGAGGGCTGCCCCGTGT 

GCTCCGCCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCTGC 

GCCTGCGCTGCGACGGCGAGGCAGACTGTCAGGACCGCTCAGACGAG 

GTGGACTGTGACGgtgaggccctcc 

3052 nt 

FIG. 3D 
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Exon 19 Coordinates: 527dl2_Contig309G 58524-58634 

tctccttgcagCCATCTGCCTGCCCAACCAGTTCCGGTGTGCGAGCGGCCAGT 

GTGTCCTCATCAAACAGCAGTGCGACTCCTTCCCCGACTGTATCGACG 

GCTCCGACGAGCTCATGTGTGgtgagccagctt 

1448 nt 



Exon 20 Coordinates: 527dl2_Contig309G 60082-60319 

gtttgtctctggcagAAATCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACA 

GCAGTGCCATCGGGCCCGTCATTGGCATCATCCTCTCTCTCTTCGTCAT 

GGGTGGTGTCTATTTTGTGTGCCAGCGCGTGGTGTGCCAGCGCTATGC 

GGGGGCCAACGGGCCCTTCCCGCACGAGTATGTCAGCGGGACCCCGC 

ACGTGCCCCTCAATTTCATAGCCCCGGGCGGTTCCCAGCATGGCCCCT 

TCACAGgtaaggagcctgagatatggaa 



....1095 nt 

Exon 21 Coordinates: 527dl2_Contig309G 61414-61552 

cttccctgccagGCATCGCATGCGGAAAGTCCATGATGAGCTCCGTGAGCCTG 
ATGGGGGGCCGGGGCGGGGTGCCCCTCTACGACCGGAACCACGTCAC 
AGGGGCCTCGTCCAGCAGCTCGTCCAGCACGAAGGCCACGCTGTACCC 

GCCGgtgaggggcggg 



6513nt 

Exon 22 Coordinates: 527dl2_Contig309G 68065-68162 

ttggctctcctcagATCCTGAACCCGCCGCCCTCCCCGGCCACGGACCCCTCCC 
TGTACAACATGGACATGTTCTACTCTTCAAACATTCCGGCCACTGCGA 
GACCGTACAGgtaggacatcccctgcag 



.2273 nt 

FIG. 3E 
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Exon 23 Coordinates: 527dl2_Contig309G 70435-70901 

tcaaacattccggccactgcgagaccgtacagGCCCTACATCATTCGAGGAATGGCGCCCC 

CGACGACGCCCTGCAGCACCGACGTGTGTGACAGCGACTACAGCGCC 

AGCCGCTGGAAGGCCAGCAAGTACTACCTGGATTTGAACTCGGACTCA 

GACCCCTATCCACCCCCACCCACGCCCCACAGCCAGTACCTGTCGGCG 

GAGGACAGCTGCCCGCCCTCGCCCGCCACCGAGAGGAGCTACTTCCAT 

CTCTTCCCGCCCCCTCCGTCCCCCTGCACGGACTCATC CTGACCTCGGC 

CGGGCCACTCTGGCTTCTCTGTGCCCCTGTAAATAGTTTTAAATATGAACAA 

AGAAAAAAATATATTTTATGATTTAAAAAATAAATATAATTGGGATTTTAA 

AAACATGAGAAATGTGAACTGTGATGGGGTGGGCAGGGCTGGGAGAACTT 

TGTACAGTGGAGAAATATTTATAAACTTAATTTTGTAAAACA 



FIG. 3F 
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HBM: Constructs for transgenic mice 

Confirmation of expression 
Transient transfection into HOB-02-02 cells* 
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*Fold increase compared to Zmaxl 
in cells transfected with empty vector 



FIG. 17 



SUBSTITUTE SHEET (RULE 26) 



WO 02/092000 



34/89 



PCT/US02/14877 



o 1.5 
'•*-> 

CO 

a: 

< 1.21 
or 

g 0.9 i 
< 



E 



N 



0.6 



£ 0.3- 

CO 



0.0 



Comparison of Human & Mouse TaqMan 
Primer/Probe Sets for Zmax1/HBM 
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HBM: In vivo pDXA - transgenic mice 
HBMMCBA construct 
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HBM transgenic mice: in vivo pDXA* 
BMD % changes vs WTin 5 week old animals 



Line n Femur Spine Total Tibia mRNA 



HBMMCBA 












2 


11 


21 


24 


10 


X7-10 


13 


4 


11 


27 


5 


x1-2 


18 


1 


8 


9 


3 


X10-11 


21 


1 


10 


12 


6 


0 


28 


15 


0 


30 


4 


x1 


30 


7 


4 


32 


4 


0 


HBMMTIC 












19 


5 


63 


70 


41 


x7-8 


35 


1 


4 


47 


6 


x1 



* CV: Femur (2. 7%); Spine (6.4%); Total (1. 7%) 



FIG. 22 



HBM transgenic mice: in vivo pDXA* 
BMD % changes vs WTin 9 week old animals 
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TCTAGACTCG AGGCGGCCGC 
GAGTGAGCGC GGCGCGGGCC 
CCGCCGTGGC CGCTGCTGCT 
CCCGCCGCGG CCTCGCCGCT 
GCCGGCGGAG TCAAGCTGGA 
GTGGACTTCC AGTTTTCCAA 
AAGCAGACCT ACCTGAACCA 
GTCTCTCCCG ACGGCCTCGC 
GAGACCAACC GCATCGAGGT 
CAGGACCTTG ACCAGCCGAG 
ACAGACTGGG TTGAGACGCC 
ATCATTGTGG ACTCGGACAT 
AAGCTCTACT GGGCTGACGC 
TTCCGGCAGA AGGTGGTGGA 
GACACTCTGT ACTGGACAGA 
GGGGGGAAGA GGAAGGAGAT 
AGCCAGGAGC GGCAGCCTTT 
CACCTGTGCC TGCTGTCCCC 
CAGCTGCAGG ACAACGGCAG 
CGGCGGACGG ACCTACGGAG 
CAGGTGGACG ACATCCGGCA 
TACTGGACAG ATGACGAGGT 
CAGACGCTGG TCAACACCGA 
CGAAACCTCT ACTGGACCGA 
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FIG. 24A 

HBMGI_2AS 

CCATTGTGCA CTAAAGCGCC 
CGTCCGGCCG CCGGACAACA 
GCTGCTGCTG CTGCTGCTGG 
CCTGCTATTT GCCAACCGCC 
GTCCACCATC GTGGTCAGCG 
GGGAGCCGTG TACTGGACAG 
GACGGGGGCC GCCGTGCAGA 
CTGCGACTGG GTGGGCAAGA 
GGCCAACCTC AATGGCACAT 
GGCCATCGCC TTGGACCCCG 
CCGGATTGAG CGGGCAGGGA 
TTACTGGCCC AATGGACTGA 
CAAGCTCAGC TTCATCCACC 
GGGCAGCCTG ACGCACCCCT 
CTGGCAGACC CGCTCCATCC 
CCTGAGTGCC CTCTACTCAC 
CTTCCACACT CGCTGTGAGG 
AAGCGAGCCT TTCTACACAT 
GACGTGTAAG GCAGGAGCCG 
GATCTCGCTG GACACGCCGG 
CGCCATTGCC ATCGACTACG 
GCGGGCCATC CGCAGGGCGT 
GATCAACGAC CCCGATGGCA 
CACGGGCACG GACCGCATCG 



GCCGCCGCGC CATGGAGCCC 
TGGAGGCAGC GCCGCCCGGG 
CGCTGTGCGG CTGCCCGGCC 
GGGACGTACG GCTGGTGGAC 
GCCTGGAGGA TGCGGCCGCA 
ACGTGAGCGA GGAGGCCATC 
ACGTGGTCAT CTCCGGCCTG 
AGCTGTACTG GACGGACTCA 
CCCGGAAGGT GCTCTTCTGG 
CTCACGGGTA CATGTACTGG 
TGGATGGCAG CACCCGGAAG 
CCATCGACCT GGAGGAGCAG 
GTGCCAACCT GGACGGCTCG 
TCGCCCTGAC GCTCTCCGGG 
ATGCCTGCAA CAAGCGCACT 
CCATGGACAT CCAGGTGCTG 
AGGACAATGG CGGCTGCTCC 
GCGCCTGCCC CACGGGTGTG 
AGGAGGTGCT GCTGCTGGCC 
ACTTCACCGA CATCGTGCTG 
ACCCGCTAGA GGGCTATGTC 
ACCTGGACGG GTCTGGGGCG 
TCGCGGTCGA CTGGGTGGCC 
AGGTGACGCG CCTCAACGGC 
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FIG. 24B 

HBMGI_2AS 

ACCTCCCGCA AGATCCTGGT GTCGGAGGAC CTGGACGAGC CCCGAGCCAT CGCACTGCAC 
CCCGTGATGG GCCTCATGTA CTGGACAGAC TGGGGAGAGA ACCCTAAAAT CGAGTGTGCC 
AACTTGGATG GGCAGGAGCG GCGTGTGCTG GTCAATGCCT CCCTCGGGTG GCCCAACGGC 
CTGGCCCTGG ACCTGCAGGA GGGGAAGCTC TACTGGGGAG ACGCCAAGAC AGACAAGATC 
GAGGTGATCA ATGTTGATGG GACGAAGAGG CGGACCCTCC TGGAGGACAA GCTCCCGCAC 
ATTTTCGGGT TCACGCTGCT GGGGGACTTC ATCTACTGGA CTGACTGGCA GCGCCGCAGC 
ATCGAGCGGG TGCACAAGGT CAAGGCCAGC CGGGACGTCA TCATTGACCA GCTGCCCGAC 
CTGATGGGGC TCAAAGCTGT GAATGTGGCC AAGGTCGTCG GAACCAACCC GTGTGCGGAC 
AGGAACGGGG GGTGCAGCCA CCTGTGCTTC TTCACACCCC ACGCAACCCG GTGTGGCTGC 
CCCATCGGCC TGGAGCTGCT GAGTGACATG AAGACCTGCA TCGTGCCTGA GGCCTTCTTG 
GTCTTCACCA GCAGAGCCGC CATCCACAGG ATCTCCCTCG AGACCAATAA CAACGACGTG 
GCCATCCCGC TCACGGGCGT CAAGGAGGCC TCAGCCCTGG ACTTTGATGT GTCCAACAAC 
CACATCTACT GGACAGACGT CAGCCTGAAG ACCATCAGCC GCGCCTTCAT GAACGGGAGC 
TCGGTGGAGC ACGTGGTGGA GTTTGGCCTT GACTACCCCG AGGGCATGGC CGTTGACTGG 
ATGGGCAAGA ACCTCTACTG GGCCGACACT GGGACCAACA GAATCGAAGT GGCGCGGCTG 
GACGGGCAGT TCCGGCAAGT CCTCGTGTGG AGGGACTTGG ACAACCCGAG GTCGCTGGCC 
CTGGATCCCA CCAAGGGCTA CATCTACTGG ACCGAGTGGG GCGGCAAGCC GAGGATCGTG 
CGGGCCTTCA TGGACGGGAC CAACTGCATG ACGCTGGTGG ACAAGGTGGG CCGGGCCAAC 
GACCTCACCA TTGACTACGC TGACCAGCGC CTCTACTGGA CCGACCTGGA CACCAACATG 
ATCGAGTCGT CCAACATGCT GGGTCAGGAG CGGGTCGTGA TTGCCGACGA TCTCCCGCAC 
CCGTTCGGTC TGACGCAGTA CAGCGATTAT ATCTACTGGA CAGACTGGAA TCTGCACAGC 
ATTGAGCGGG CCGACAAGAC TAGCGGCCGG AACCGCACCC TCATCCAGGG CCACCTGGAC 
TTCGTGATGG ACATCCTGGT GTTCCACTCC TCCCGCCAGG ATGGCCTCAA TGACTGTATG 
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FIG. 24C 

HBMGI_2AS 

CACAACAACG GGCAGTGTGG GCAGCTGTGC CTTGCCATCC CCGGCGGCCA CCGCTGCGGC 
TGCGCCTCAC ACTACACCCT GGACCCCAGC AGCCGCAACT GCAGCCCGCC CACCACCTTC 
TTGCTGTTCA GCCAGAAATC TGCCATCAGT CGGATGATCC CGGACGACCA GCACAGCCCG 
GATCTCATCC TGCCCCTGCA TGGACTGAGG AACGTCAAAG CCATCGACTA TGACCCACTG 
GACAAGTTCA TCTACTGGGT GGATGGGCGC CAGAACATCA AGCGAGCCAA GGACGACGGG 
ACCCAGCCCT TTGTTTTGAC CTCTCTGAGC CAAGGCCAAA ACCCAGACAG GCAGCCCCAC 
GACCTCAGCA TCGACATCTA CAGCCGGACA CTGTTCTGGA CGTGCGAGGC CACCAATACC 
ATCAACGTCC ACAGGCTGAG CGGGGAAGCC ATGGGGGTGG TGCTGCGTGG GGACCGCGAC 
AAGCCCAGGG CCATCGTCGT CAACGCGGAG CGAGGGTACC TGTACTTCAC CAACATGCAG 
GACCGGGCAG CCAAGATCGA ACGCGCAGCC CTGGACGGCA CCGAGCGCGA GGTCCTCTTC 
ACCACCGGCC TCATCCGCCC TGTGGCCCTG GTGGTAGACA ACACACTGGG CAAGCTGTTC 
TGGGTGGACG CGGACCTGAA GCGCATTGAG AGCTGTGACC TGTCAGGGGC CAACCGCCTG 
ACCCTGGAGG ACGCCAACAT CGTGCAGCCT CTGGGCCTGA CCATCCTTGG CAAGCATCTC 
TACTGGATCG ACCGCCAGCA GCAGATGATC GAGCGTGTGG AGAAGACCAC CGGGGACAAG 
CGGACTCGCA TCCAGGGCCG TGTCGCCCAC CTCACTGGCA TCCATGCAGT GGAGGAAGTC 
AGCCTGGAGG AGTTCTCAGC CCACCCATGT GCCCGTGACA ATGGTGGCTG CTCCCACATC 
TGTATTGCCA AGGGTGATGG GACACCACGG TGCTCATGCC CAGTCCACCT CGTGCTCCTG 
CAGAACCTGC TGACCTGTGG AGAGCCGCCC ACCTGCTCCC CGGACCAGTT TGCATGTGCC 
ACAGGGGAGA TCGACTGTAT CCCCGGGGCC TGGCGCTGTG ACGGCTTTCC CGAGTGCGAT 
GACCAGAGCG ACGAGGAGGG CTGCCCCGTG TGCTCCGCCG CCCAGTTCCC CTGCGCGCGG 
GGTCAGTGTG TGGACCTGCG CCTGCGCTGC GACGGCGAGG CAGACTGTCA GGACCGCTCA 
GACGAGGCGG ACTGTGACGC CATCTGCCTG CCCAACCAGT TCCGGTGTGC GAGCGGCCAG 
TGTGTCCTCA TCAAACAGCA GTGCGACTCC TTCCCCGACT GTATCGACGG CTCCGACGAG 
CTCATGTGTG AAATCACCAA GCCGCCCTCA GACGACAGCC CGGCCCACAG CAGTGCCATC 
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GGGCCCGTCA TTGGCATCAT 
CAGCGCGTGG TGTGCCAGCG 
AGCGGGACCC CGCACGTGCC 
TTCACAGGCA TCGCATGCGG 
GGCGGGGTGC CCCTCTACGA 
AGCACGAAGG CCACGCTGTA 
CCCTCCCTGT ACAACATGGA 
TACAGGCCCT ACATCATTCG 
TGTGACAGCG ACTACAGCGC 
TCGGACTCAG ACCCCTATCC 
GACAGCTGCC CGCCCTCGCC 
CCGTCCCCCT GCACGGACTC 
CTGTAAATAG TTTTAAATAT 
TATAATTGGG ATTTTAAAAA 
AGAACTTTGT ACAGTGGAGA 
AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA 



FIG. 24D 

HBMGL2AS 

CCTCTCTCTC TTCGTCATGG 
CTATGCGGGG GCCAACGGGC 
CCTCAATTTC ATAGCCCCGG 
AAAGTCCATG ATGAGCTCCG 
CCGGAACCAC GTCACAGGGG 
CCCGCCGATC CTGAACCCGC 
CATGTTCTAC TCTTCAAACA 
AGGAATGGCG CCCCCGACGA 
CAGCCGCTGG AAGGCCAGCA 
ACCCCCACCC ACGCCCCACA 
CGCCACCGAG AGGAGCTACT 
ATCCTGACCT CGGCCGGGCC 
GAACAAAGAA AAAAATATAT 
CATGAGAAAT GTGAACTGTG 
AATATTTATA AACTTAATTT 
AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA 



GTGGTGTCTA TTTTGTGTGC 
CCTTCCCGCA CGAGTATGTC 
GCGGTTCCCA GCATGGCCCC 
TGAGCCTGAT GGGGGGCCGG 
CCTCGTCCAG CAGCTCGTCC 
CGCCCTCCCC GGCCACGGAC 
TTCCGGCCAC TGCGAGACCG 
CGCCCTGCAG CACCGACGTG 
AGTACTACCT GGATTTGAAC 
GCCAGTACCT GTCGGCGGAG 
TCCATCTCTT CCCGCCCCCT 
ACTCTGGCTT CTCTGTGtCC 
TTTATGATTT AAAAAATAAA 
ATGGGGTGGG CAGGGCTGGG 
TGTAAAACAG AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA GCGGCCGC 
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TCTAGACTCG AGGCGGCCGC 
GAGTGAGCGC GGCGCGGGCC 
CCGCCGTGGC CGCTGCTGCT 
CCCGCCGCGG CCTCGCCGCT 
GCCGGCGGAG TCAAGCTGGA 
GTGGACTTCC AGTTTTCCAA 
AAGCAGACCT ACCTGAACCA 
GTCTCTCCCG ACGGCCTCGC 
GAGACCAACC GCATCGAGGT 
CAGGACCTTG ACCAGCCGAG 
ACAGACTGGG GTGAGACGCC 
ATCATTGTGG ACTCGGACAT 
AAGCTCTACT GGGCTGACGC 
TTCCGGCAGA AGGTGGTGGA 
GACACTCTGT ACTGGACAGA 
GGGGGGAAGA GGAAGGAGAT 
AGCCAGGAGC GGCAGCCTTT 
CACCTGTGCC TGCTGTCCCC 
CAGCTGCAGG ACAACGGCAG 
CGGCGGACGG ACCTACGGAG 
CAGGTGGACG ACATCCGGCA 
TACTGGACAG ATGACGAGGT 
CAGACGCTGG TCAACACCGA 
CGAAACCTCT ACTGGACCGA 



FIG. 25A 

ZMAXGI_3AS 

CCATTGTGCA CTAAAGCGCC 
CGTCCGGCCG CCGGACAACA 
GCTGCTGCTG CTGCTGCTGG 
CCTGCTATTT GCCAACCGCC 
GTCCACCATC GTGGTCAGCG 
GGGAGCCGTG TACTGGACAG 
GACGGGGGCC GCCGTGCAGA 
CTGCGACTGG GTGGGCAAGA 
GGCCAACCTC AATGGCACAT 
GGCCATCGCC TTGGACCCCG 
CCGGATTGAG CGGGCAGGGA 
TTACTGGCCC AATGGACTGA 
CAAGCTCAGC TTCATCCACC 
GGGCAGCCTG ACGCACCCCT 
CTGGCAGACC CGCTCCATCC 
CCTGAGTGCC CTCTACTCAC 
CTTCCACACT CGCTGTGAGG 
AAGCGAGCCT TTCTACACAT 
GACGTGTAAG GCAGGAGCCG 
GATCTCGCTG GACACGCCGG 
CGCCATTGCC ATCGACTACG 
GCGGGCCATC CGCAGGGCGT 
GATCAACGAC CCCGATGGCA 
CACGGGCACG GACCGCATCG 



GCCGCCGCGC CATGGAGCCC 
TGGAGGCAGC GCCGCCCGGG 
CGCTGTGCGG CTGCCCGGCC 
GGGACGTACG GCTGGTGGAC 
GCCTGGAGGA TGCGGCCGCA 
ACGTGAGCGA GGAGGCCATC 
ACGTGGTCAT CTCCGGCCTG 
AGCTGTACTG GACGGACTCA 
CCCGGAAGGT GCTCTTCTGG 
CTCACGGGTA CATGTACTGG 
TGGATGGCAG CACCCGGAAG 
CCATCGACCT GGAGGAGCAG 
GTGCCAACCT GGACGGCTCG 
TCGCCCTGAC GCTCTCCGGG 
ATGCCTGCAA CAAGCGCACT 
CCATGGACAT CCAGGTGCTG 
AGGACAATGG CGGCTGCTCC 
GCGCCTGCCC CACGGGTGTG 
AGGAGGTGCT GCTGCTGGCC 
ACTTCACCGA CATCGTGCTG 
ACCCGCTAGA GGGCTATGTC 
ACCTGGACGG GTCTGGGGCG 
TCGCGGTCGA CTGGGTGGCC 
AGGTGACGCG CCTCAACGGC 
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FIG. 25B 

ZMAXGL3AS 

ACCTCCCGCA AGATCCTGGT GTCGGAGGAC CTGGACGAGC CCCGAGCCAT CGCACTGCAC 
CCCGTGATGG GCCTCATGTA CTGGACAGAC TGGGGAGAGA ACCCTAAAAT CGAGTGTGCC 
AACTTGGATG GGCAGGAGCG GCGTGTGCTG GTCAATGCCT CCCTCGGGTG GCCCAACGGC 
CTGGCCCTGG ACCTGCAGGA GGGGAAGCTC TACTGGGGAG ACGCCAAGAC AGACAAGATC 
GAGGTGATCA ATGTTGATGG GACGAAGAGG CGGACCCTCC TGGAGGACAA GCTCCCGCAC 
ATTTTCGGGT TCACGCTGCT GGGGGACTTC ATCTACTGGA CTGACTGGCA GCGCCGCAGC 
ATCGAGCGGG TGCACAAGGT CAAGGCCAGC CGGGACGTCA TCATTGACCA GCTGCCCGAC 
CTGATGGGGC TCAAAGCTGT GAATGTGGCC AAGGTCGTCG GAACCAACCC GTGTGCGGAC 
AGGAACGGGG GGTGCAGCCA CCTGTGCTTC TTCACACCCC ACGCAACCCG GTGTGGCTGC 
CCCATCGGCC TGGAGCTGCT GAGTGACATG AAGACCTGCA TCGTGCCTGA GGCCTTCTTG 
GTCTTCACCA GCAGAGCCGC CATCCACAGG ATCTCCCTCG AGACCAATAA CAACGACGTG 
GCCATCCCGC TCACGGGCGT CAAGGAGGCC TCAGCCCTGG ACTTTGATGT GTCCAACAAC 
CACATCTACT GGACAGACGT CAGCCTGAAG ACCATCAGCC GCGCCTTCAT GAACGGGAGC 
TCGGTGGAGC ACGTGGTGGA GTTTGGCCTT GACTACCCCG AGGGCATGGC CGTTGACTGG 
ATGGGCAAGA ACCTCTACTG GGCCGACACT GGGACCAACA GAATCGAAGT GGCGCGGCTG 
GACGGGCAGT TCCGGCAAGT CCTCGTGTGG AGGGACTTGG ACAACCCGAG GTCGCTGGCC 
CTGGATCCCA CCAAGGGCTA CATCTACTGG ACCGAGTGGG GCGGCAAGCC GAGGATCGTG 
CGGGCCTTCA TGGACGGGAC CAACTGCATG ACGCTGGTGG ACAAGGTGGG CCGGGCCAAC 
GACCTCACCA TTGACTACGC TGACCAGCGC CTCTACTGGA CCGACCTGGA CACCAACATG 
ATCGAGTCGT CCAACATGCT GGGTCAGGAG CGGGTCGTGA TTGCCGACGA TCTCCCGCAC 
CCGTTCGGTC TGACGCAGTA CAGCGATTAT ATCTACTGGA CAGACTGGAA TCTGCACAGC 
ATTGAGCGGG CCGACAAGAC TAGCGGCCGG AACCGCACCC TCATCCAGGG CCACCTGGAC 
TTCGTGATGG ACATCCTGGT GTTCCACTCC TCCCGCCAGG ATGGCCTCAA TGACTGTATG 
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FIG. 25C 

ZMAXGI_3AS 

CACAACAACG GGCAGTGTGG GCAGCTGTGC CTTGCCATCC CCGGCGGCCA CCGCTGCGGC 
TGCGCCTCAC ACTACACCCT GGACCCCAGC AGCCGCAACT GCAGCCCGCC CACCACCTTC 
TTGCTGTTCA GCCAGAAATC TGCCATCAGT CGGATGATCC CGGACGACCA GCACAGCCCG 
GATCTCATCC TGCCCCTGCA TGGACTGAGG AACGTCAAAG CCATCGACTA TGACCCACTG 
GACAAGTTCA TCTACTGGGT GGATGGGCGC CAGAACATCA AGCGAGCCAA GGACGACGGG 
ACCCAGCCCT TTGTTTTGAC CTCTCTGAGC CAAGGCCAAA ACCCAGACAG GCAGCCCCAC 
GACCTCAGCA TCGACATCTA CAGCCGGACA CTGTTCTGGA CGTGCGAGGC CACCAATACC 
ATCAACGTCC ACAGGCTGAG CGGGGAAGCC ATGGGGGTGG TGCTGCGTGG GGACCGCGAC 
AAGCCCAGGG CCATCGTCGT CAACGCGGAG CGAGGGTACC TGTACTTCAC CAACATGCAG 
GACCGGGCAG CCAAGATCGA ACGCGCAGCC CTGGACGGCA CCGAGCGCGA GGTCCTCTTC 
ACCACCGGCC TCATCCGCCC TGTGGCCCTG GTGGTAGACA ACACACTGGG CAAGCTGTTC 
TGGGTGGACG CGGACCTGAA GCGCATTGAG AGCTGTGACC TGTCAGGGGC CAACCGCCTG 
ACCCTGGAGG ACGCCAACAT CGTGCAGCCT CTGGGCCTGA CCATCCTTGG CAAGCATCTC 
TACTGGATCG ACCGCCAGCA GCAGATGATC GAGCGTGTGG AGAAGACCAC CGGGGACAAG 
CGGACTCGCA TCCAGGGCCG TGTCGCCCAC CTCACTGGCA TCCATGCAGT GGAGGAAGTC 
AGCCTGGAGG AGTTCTCAGC CCACCCATGT GCCCGTGACA ATGGTGGCTG CTCCCACATC 
TGTATTGCCA AGGGTGATGG GACACCACGG TGCTCATGCC CAGTCCACCT CGTGCTCCTG 
CAGAACCTGC TGACCTGTGG AGAGCCGCCC ACCTGCTCCC CGGACCAGTT TGCATGTGCC 
ACAGGGGAGA TCGACTGTAT CCCCGGGGCC TGGCGCTGTG ACGGCTTTCC CGAGTGCGAT 
GACCAGAGCG ACGAGGAGGG CTGCCCCGTG TGCTCCGCCGC CCAGTTCCCC TGCGCGCGG 
GGTCAGTGTGT GGACCTGCGC CTGCGCTGCG ACGGCGAGGC AGACTGTCAG GACCGCTCA 
GACGAGGCGGA CTGTGACGCC ATCTGCCTGC CCAACCAGTT CCGGTGTGCG AGCGGCCAG 
TGTGTCCTCAT CAAACAGCAG TGCGACTCCT TCCCCGACTG TATCGACGGC TCCGACGAG 
CTCATGTGTGA AATCACCAAG CCGCCCTCAG ACGACAGCCC GGCCCACAGC AGTGCCATC 
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FIG. 25D 

ZMAXGI_3AS 

GGGCCCGTCA TTGGCATCAT CCTCTCTCTC TTCGTCATGG GTGGTGTCTA TTTTGTGTGC 
CAGCGCGTGG TGTGCCAGCG CTATGCGGGG GCCAACGGGC CCTTCCCGCA CGAGTATGTC 

AGCGGGACCC CGCACGTGCC CCTCAATTTC ATAGCCCCGG GCGGTTCCCA GCATGGCCCC 

TTCACAGGCA TCGCATGCGG AAAGTCCATG ATGAGCTCCG TGAGCCTGAT GGGGGGCCGG 

GGCGGGGTGC CCCTCTACGA CCGGAACCAC GTCACAGGGG CCTCGTCCAG CAGCTCGTCC 

AGCACGAAGG CCACGCTGTA CCCGCCGATC CTGAACCCGC CGCCCTCCCC GGCCACGGAC 

CCCTCCCTGT ACAACATGGA CATGTTCTAC TCTTCAAACA TTCCGGCCAC TGCGAGACCG 

TACAGGCCCT ACATCATTCG AGGAATGGCG CCCCCGACGA CGCCCTGCAG CACCGACGTG 

TGTGACAGCG ACTACAGCGC CAGCCGCTGG AAGGCCAGCA AGTACTACCT GGATTTGAAC 

TCGGACTCAG ACCCCTATCC ACCCCCACCC ACGCCCCACA GCCAGTACCT GTCGGCGGAG 

GACAGCTGCC CGCCCTCGCC CGCCACCGAG AGGAGCTACT TCCATCTCTT CCCGCCCCCT 

CCGTCCCCCT GCACGGACTC ATCCTGACCT CGGCCGGGCC ACTCTGGCTT CTCTGTGCCC 

CTGTAAATAG TTTTAAATAT GAACAAAGAA AAAAATATAT TTTATGATTT AAAAAATAAA 

TATAATTGGG ATTTTAAAAA CATGAGAAAT GTGAACTGTG ATGGGGTGGG CAGGGCTGGG 

AGAACTTTGT ACAGTGGAGA AATATTTATA AACTTAATTT TGTAAAACAG AAAAAAAAAA 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA GCGGCCGC 
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FIG. 26A 

Alignment of Human and Mouse Zmaxl (LRP5) 
Amino-Acid Sequences. 
Top= Human, Bottom = Mouse 
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FIG. 26B 

Alignment of Human and Mouse Zmaxl (LRP5) 
Amino-Acid Sequences. 
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FIG. 26C 

Alignment of Human and Mouse Zmaxl (LRP5) 
Amino-Acid Sequences. 
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FIG. 27A 

Alignment of Human LRP5 and LRP6 
Top = LRP5, Bottom = LRP6 
Underlined = region marked for antibody production 
Percent Similarity: 83.542 Percent Identity: 70.901 
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FIG. 27B 

Alignment of Human LRP5 and LRP6 
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FIG. 27C 

Alignment of Human LRP5 and LRP6 
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Model of Wnt signaling 

FIG. 28 
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Listed are the pcDNA3.1 construct names followed by the DNA 
sequence 

OST258 (control for OST 259-OST262 and OST264 , OST265) 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 

GGTTCCAGGTTCCACTGGTGACGGATCC 

OST259 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCATGAGCGATAAAATTATTCACCTGA 
CTGACGACAGTTTTGACACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTC 
GATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCGTGGTTCTGTGTTCGCGTTG 
TGGGCGTTTGTGGCGGTGGTCGTGTGGGACTAGTGGTCCGTGCAAAATGATCG 
CCCCGATTCTGGATGAAATCGCTGACGAATATCAGGGCAAACTGACCGTTGCA 
AAACTGAACATCGATCAAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGG 
TATCCCGACTCTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGG 
GTGCACTGTCTAAAGGTCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAA 
GCGGCCGC 

OST260 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCATGAGCGATAAAATTATTCACCTGA 
CTGACGACAGTTTTGACACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTC 
GATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCGGGTGGCGGTGGTGTGGTCG 
GTGTGGGGCTTTGTGGTGGCGGCGTGTTACTAGTGGTCCGTGCAAAATGATCG 
CCCCGATTCTGGATGAAATCGCTGACGAATATCAGGGCAAACTGACCGTTGCA 
AAACTGAACATCGATCAAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGG 
TATCCCGACTCTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGG 
GTGCACTGTCTAAAGGTCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAA 
GCGGCCGC 

OST261 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCATGAGCGATAAAATTATTCACCTGA 
CTGACGACAGTTTTGACACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTC 
GATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCGAGGTGCGGCAGGTTACGTG 
TATTAGGTGTCGTCGGGGTTTTCTGTTGACTAGTGGTCCGTGCAAAATGATCG 
CCCCGATTCTGGATGAAATCGCTGACGAATATCAGGGCAAACTGACCGTTGCA 

FIG. 39A 
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AAACTGAACATCGATCAAT^ACCCTGGCACTGCGCCGAAATATGGCATCCGTGG 
TATCCCGACTCTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGG 
GTGCACTGTCTAAAGGTCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAA 
GCGGCCGC 

OST262 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCATGAGCGATAAAATTATTCACCTGA 
CTGACGACAGTTTTGACACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTC 
GATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCGGTGGTGGGGGGATGATTTG 
GGAGGCTTGGAGTTGTTATGCGTGTGGGACTAGTGGTCCGTGCAAAATGATCG 
CCCCGATTCTGGATGAAATCGCTGACGAATATCAGGGCAAACTGACCGTTGCA 
AAACTGAACATCGATCAAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGG 
TATCCCGACTCTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGG 
GTGCACTGTCTAAA.GGTCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAA. 
GCGGCCGC 

OST263 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCATGAGCGATAAAATTATTCACCTGA 
CTGACGACAGTTTTGACACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTC 
GATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCTTGTGGATTGGGCCGGGTGA 
TCAGGGTCTGTTTCGGCGTTTTGTTTTTACTAGTGGTCCGTGCAAAATGATCG 
CCCCGATTCTGGATGAAATCGCTGACGAATATCAGGGCAAACTGACCGTTGCA 
AAACTGAACATCGATCAAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGG 
TATCCCGACTCTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGG 
GTGCACTGTCTAAAGGTCAGTTGAA^GAGTTCCTCGACGCTAACCTGGCGTAA 
GCGGCCGC 

OST264 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCGTGTCTTCTGATCAAAATCATTTCC 
GAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGATCATAGCACC 
TTGGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAA 
AGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGT 
GTTGTGCTAGACACTTCTGGTCCAAGATCTGTAAACCTGTCCTGAAAGAAGGT 
CAAGTGTGTACCAAGCATAGGAGAAAAGGCTCTCATGGACTAGAAATATTCCA 
GCGTTGTTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGAAAGATCACCATC 
AAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGAGACACTAAGCGGCCGC 

FIG. 39B 
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OST265 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCTGCGCTAGTCCCACCCGCGGAGGGG 
ACGCGGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACGCTGCATG 
CGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTC 
TTCTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCT 
TTGGTAATGATCATAGCACCTTGGATGGGTATTCCAGAAGAACCACCTTGTCT 
TCAAAAATGTATCACACCAAAGGACAA.GAAGGTTCTGTTTGTCTCCGGTCATC 
AGACTGTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGTCCAAGATCTGTA 
AACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCTCT 
CATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTCTTGCTA 
AGCGGCCGC 

OST266 

AAGCTTGCCACCATGGGCGATAAAATTATTCACCTGACTGACGACAGTTTTGA 
CACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTCGATTTCTGGGCAGAGT 
GGTGCGGTCCGAATTCCTATGCGTGGTTGTTTTCTTGTAGTAGGTGTAGGTGG 
TGGTTGCCTTGGACTAGTGGTCCGTGCAAAATGATCGCCCCGATTCTGGATGA 
AATCGCTGACGAATATCAGGGCAAACTGACCGTTGCAAAACTGAACATCGATC 
AAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGGTATCCCGACTCTGCTG 
CTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGGGTGCACTGTCTAAAGG 
TCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAAGCGGCCGC 

OST267 

AAGCTTGCCACCATGGGCGATAAAATTATTCACCTGACTGACGACAGTTTTGA 
CACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTCGATTTCTGGGCAGAGT 
GGTGCGGTCCGAATTCCATTTGTGAGGTTGTGAGGTTGTGGAGTCGGTATCCT 
TGGTCTTGGGTGACTAGTGGTCCGTGCAAAATGATCGCCCCGATTCTGGATGA 
AATCGCTGACGAATATCAGGGCAAACTGACCGTTGCAAAACTGAACATCGATC 
AAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGGTATCCCGACTCTGCTG 
CTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGGGTGCACTGTCTAAAGG 
TCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAAGCGGCCGC 

FIG. 39C 
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OST268 

AAGCTTGCCACCATGGGCGATAAAATTATTCACCTGACTGACGACAGTTTTGA 
CACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTCGATTTCTGGGCAGAGT 
GGTGCGGTCCGAATTCCGGTTGTACTAGTGCGGTGTGTGGTGCTTGGGCTGAG 
GCGGGTAGGTTTTATTGTACTAGTGGTCCGTGCAAAATGATCGCCCCGATTCT 
GGATGAAATCGCTGACGAATATCAGGGCAAACTGACCGTTGCAAAACTGAACA 
TCGATCAAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGGTATCCCGACT 
CTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGGGTGCACTGTC 
TAAAGGTCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAAGCGGCCGC 

OST269 (irrelevant control peptide for OST266-OST268) 

AAGCTTGCCACCATGGGCGATAAAATTATTCACCTGACTGACGACAGTTTTGA 

CACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTCGATTTCTGGGCAGAGT 

GGTGCGGTCCGAATTCCTTGTGGATTGGGCCGGGTGATCAGGGTCTGTTTCGG 

CGTTTTGTTTTTACTAGTGGTCCGTGCAAAATGATCGCCCCGATTCTGGATGA 

AATCGCTGACGAATATCAGGGCAAACTGACCGTTGCAAAACTGAACATCGATC 

AAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGGTATCCCGACTCTGCTG 

CTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGGGTGCACTGTCTAAAGG 

TCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAAGCGGCCGC 



FIG. 39D 
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Listed below are the amino acid sequences corresponding to the 
pcDNA3.1 constructs in Appendix 1A 

OST258 

METDTLLLWVLLLWVPGSTGDGS 
OST259 

METDTLLLWVLLLWVPGSTGDGSMSDKIIHLTDDSFDTDVLKADGAILVDFWA 
EWCGPNSWLCSRCGRLWRWSCGTSGPCKMIAPILDEIADEYQGKLTVAKLNI 
DQNPGTAPKYGI RGI PTLLLFKNGEVAATKVGALSKGQLKEFLDANLA 

OST2 60 

METDTLLLWVLLLWVPGSTGDGSMSDKIIHLTDDSFDTDVLKADGAILVDFWA 
EWCGPNSGWRWCGRCGALWWRRVTSGPCKMIAPILDEIADEYQGKLTVAKLNI 
DQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLA 

OST2 61 

METDTLLLWVLLLWVPGSTGDGSMSDKIIHLTDDSFDTDVLKADGAILVDFWA 
EWCGPNSEVRQVTCIRCRRGFLLTSGPCKMIAPILDEIADEYQGKLTVAKLNI 
DQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLA 

OST2 62 

METDTLLLWVLLL WVPGSTGDGSMSDKI IHLTDDS FDTDVLKADGAI LVDFWA 
EWCGPNSGGGGMIWEAWSCYACGTSGPCKMIAPILDEIADEYQGKLTVAKLNI 
DQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDAMLA 

OST263 

METDTLLLWVLLLWVPGSTGDGSMSDKIIHLTDDSFDTDVLKADGAILVDFWA 
EWCGPNSLWIGPGDQGLFRRFVFTSGPCKMIAPILDEIADEYQGKLTVAKLMI 
DQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDMTLA 

OST264 

METDTLLLWVLLLWVPGSTGDGSVSSDQNHFRGEIEETITESFGNDHSTLDGY 
S RRTTL S S KMYHTKGQEGS VCLRS S D CAS GLCCARHF WS KI CKPVL KEGQ VCT 
KHRRKGSHGLE I FQRC YCGEGLS CR I QKDHHQASNS SRLHTCQRH 



FIG. 44A 
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OST265 

METDTLLL WVLLLWVPGS TGDGS CAS PTRGGDAGVQ I CLACRKRRKRCMRHAM 
CCPGNYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKMY 
HTKGQEGSVCLRSSDCASGLCCARHFWSKICKPVLKEGQVCTKHRRKGSHGLE 
IFQRCYCGEGLSC . 

OST266 

MGDKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPNSYAWLFSCSRCRWWLPW 
TSGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLFKN 
GEVAATKVGALS KGQLKE FLDANLA 

OST267 

MGDKI IHLTDDS FDTDVLKADGAILVDFWAEWCGPNS I CEWRLWSRYPWSWV 
TSGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLFKN 
GEVAATKVGALS KGQLKE FLDANLA 

OST268 

MGDKI IHLTDDS FDTDVLKADGAILVDFWAEWCGPNSGCTSAVCGAWAEAGRF 
YCTSGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLF 
KNGEVAATKVGALS KGQLKE FLDANLA 

OST269 

MGDKI IHLTDDS FDTDVLKADGAILVDFWAEWCGPNSLWIGPGDQGLFRRFVF 
TSGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLFKN 
GEVAATKVGALS KGQLKE FLDANLA 



FIG. 44B 
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FIG. 50 



plasma membrane 




LY propeller blade (YWTD "domain'Vrepeat 
1-43-45 aa ae ea./~265/prop.]) 

EGF epidermal growth factor-like domain [~42 aa] 

LA LDL receptor class A module [-38 aa] 

TM transmembrane helix [22 aa] 

ICD intracellular domain [1 07 aa] 
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SEQUENCE LISTING 

<110> Allen, Kristina M. 
Yaworsky, Paul 
Morales, Arturo J. 
Graham, James R. 
Anisowicz, Anthony- 
Liu, Wei 

<120> HBM Variants that Modulate Bone Mass and Lipid Levels 

<130> 032796-135 

<150> US 60/290,071 
<151> 2001-05-11 

<150> US 60/291,311 
<151> 2001-05-17 

<150> US 60/353,058 
<151> 2002-02-01 

<150> US 60/361,293 
<151> 2002-03-04 

<160> 765 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 5120 
<212> DNA 

<213> Homo sapiens 
<400> 1 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 60 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 

15 10 



ctg 


ctg 


ctg 


ctg 


ctg 


ctg 


ctg 


gcg 


ctg 


tgc 


ggc 


tgc 


ccg 


gcc 


ccc 


gcc 


157 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ala 


Leu 


Cys 


Gly 


Cys 


Pro 


Ala 


Pro 


Ala 






15 










20 










25 












gcg 


gcc 


teg 


ccg 


etc 


ctg 


eta 


ttt 


gcc 


aac 


cgc 


egg 


gac 


gta 


egg 


ctg 


205 


Ala 


Ala 


Ser 


Pro 


Leu 


Leu 


Leu 


Phe 


Ala 


Asn 


Arg 


Arg 


Asp 


Val 


Arg 


Leu 




30 










35 










40 










45 




gtg 


gac 


gcc 


ggc 


gga 


gtc 


aag 


ctg 


gag 


tec 


acc 


ate 


gtg 


gtc 


age 


ggc 


253 


Val 


Asp 


Ala 


Gly 


Gly 


Val 


Lys 


Leu 


Glu 


Ser 


Thr 


He 


Val 


Val 


Ser 


Gly 












50 










55 










60 






ctg 


gag 


gat 


gcg 


gcc 


gca 


gtg 


gac 


ttc 


cag 


ttt 


tec 


aag 


gga 


gcc 


gtg 


301 


Leu 


Glu 


Asp 


Ala 


Ala 


Ala 


Val 


Asp 


Phe 


Gin 


Phe 


Ser 


Lys 


Gly 


Ala 


Val 










65 










70 










75 








tac 


tgg 


aca 


gac 


gtg 


age 


gag 


gag 


gcc 


ate 


aag 


cag 


acc 


tac 


ctg 


aac 


349 


Tyr 


Trp 


Thr 


Asp 


Val 


Ser 


Glu 


Glu 


Ala 


He 


Lys 


Gin 


Thr 


Tyr 


Leu 


Asn 
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80 




85 




cag 


acg 


ggg 


gec 


gec gtg cag aac 


gtg 


Gin 


Thr 


Gly 


Ala 


Ala Val Gin Asn 


Val 




95 






100 




ccc 


gac 


ggc 


etc 


gec tgc gac tgg 


gtg 


Pro 


Asp 


Gly 


Leu 


Ala Cys Asp Trp 


Val 


110 








115 




gac 


tea 


gag 


acc 


aac cgc ate gag 


gtg 


Asp 


Ser 


Glu 


Thr 


Asn Arg He Glu 


Val 










130 




egg 


aag 


gtg 


etc 


ttc tgg cag gac 


ctt 


Arg 


Lys 


Val 


Leu 


Phe Trp Gin Asp 


Leu 








145 




150 


ttg 


gac 


ccc 


get 


cac ggg tac atg 


tac 


Leu 


Asp 


Pro 


Ala 


His Gly Tyr Met 


Tyr 






160 




165 




ccc 


egg 


att 


gag 


egg gca ggg atg 


gat 


Pro 


Arg 


He 


Glu 


Arg Ala Gly Met 


Asp 




175 






180 




gtg gac 


teg 


gac 


att tac tgg ccc 


aat 


Val 


Asp 


Ser 


Asp 


He Tyr Trp Pro 


Asn 


190 








195 




gag 


cag 


aag 


etc 


tac tgg get gac 


gee 


Glu 


Gin 


Lys 


Leu 


Tyr Trp Ala Asp 


Ala 










210 




gec 


aac 


ctg 


gac 


ggc teg ttc egg 


cag 


Ala 


Asn 


Leu 


Asp 


Gly Ser Phe Arg 


Gin 








225 




230 


acg 


cac 


ccc 


ttc 


gee ctg acg etc 


tec 


Thr 


His 


Pro 


Phe 


Ala Leu Thr Leu 


Ser 






240 




245 




gac 


tgg 


cag 


acc 


cgc tec ate cat 


gee 


Asp 


Trp 


Gin 


Thr 


Arg Ser He His 


Ala 




255 






260 




aag 


agg 


aag 


gag 


ate ctg agt gec 


etc 


Lys 


Arg 


Lys 


Glu 


He Leu Ser Ala 


Leu 


270 








275 




gtg 


ctg 


age 


cag 


gag egg cag cct 


ttc 


Val 


Leu 


Ser 


Gin 


Glu Arg Gin Pro 


Phe 










290 




gac 


aat 


ggc 


ggc 


tgc tec cac ctg 


tgc 


Asp 


Asn 


Gly 


Gly 


Cys Ser His Leu 


Cys 








305 




310 


ttc 


tac 


aca 


tgc 


gee tgc ccc acg 


ggt 


Phe 


Tyr 


Thr 


Cys 


Ala Cys Pro Thr 


Gly 






320 




325 




agg 


acg 


tgt 


aag 


gca gga gec gag 


gag 


Arg 


Thr 


Cys 


Lys 


Ala Gly Ala Glu 


Glu 




335 






340 




acg 


gac 


eta 


egg 


agg ate teg ctg 


gac 


Thr Asp 


Leu 


Arg 


Arg He Ser Leu 


Asp 


350 








355 




gtg 


ctg 


cag 


gtg 


gac gac ate egg 


cac 


val 


Leu 


Gin 


Val 


Asp Asp He Arg 


His 



90 



gtc 


ate 


tec 


ggc ctg gtc 


tct 


397 


Val 


He 


Ser 


Gly Leu Val 


Ser 








105 








ggc 


aag 


aag 


ctg tac tgg 


acg 


445 


Gly 


Lys 


Lys 


Leu Tyr Trp 


Thr 






120 






125 




gee 


aac 


etc 


aat ggc aca 


tec 


493 


Ala 


Asn 


Leu 


Asn Gly Thr 


Ser 




135 






140 






gac 


cag 


ccg 


agg gee ate 


gec 


541 


Asp 


Gin 


Pro 


Arg Ala He 


Ala 










155 






tgg 


aca 


gac 


tgg ggt gag 


acg 


589 


Trp 


Thr 


Asp 


Trp Gly Glu 


Thr 










170 






ggc 


age 


acc 


egg aag ate 


att 


637 


Gly 


Ser 


Thr 


Arg Lys He 


He 








185 








gga 


ctg 


acc 


ate gac ctg 


gag 


685 


Gly 


Leu 


Thr 


He Asp Leu 


Glu 






200 






205 




aag 


etc 


age 


ttc ate cac 


cgt 


733 


Lys 


Leu 


Ser 


Phe He His 


Arg 




215 






220 






aag 


gtg 


gtg 


gag ggc age 


ctg 


781 


Lys 


Val 


Val 


Glu Gly Ser 


Leu 










235 






ggg 


gac 


act 


ctg tac tgg 


aca 


829 


Gly 


Asp 


Thr 


Leu Tyr Trp 


Thr 










250 






tgc 


aac 


aag 


cgc act ggg 


ggg 


877 


Cys 


Asn 


Lys 


Arg Thr Gly 


Gly 








265 








tac 


tea 


ccc 


atg gac ate 


cag 


925 


Tyr 


Ser 


Pro 


Met Asp He 


Gin 






280 






285 




ttc 


cac 


act 


cgc tgt gag 


gag 


973 


Phe 


His 


Thr 


Arg Cys Glu 


Glu 




295 






300 






ctg 


ctg 


tec 


cca age gag 


cct 


1021 


Leu 


Leu 


Ser 


Pro Ser Glu 


Pro 










315 






gtg 


cag 


ctg 


cag gac aac 


ggc 


1069 


Val 


Gin 


Leu 


Gin Asp Asn 


Gly 










330 






gtg 


ctg 


ctg 


ctg gee egg 


egg 


1117 


Val 


Leu 


Leu 


Leu Ala Arg 


Arg 








345 








acg 


ccg 


gac 


ttc acc gac 


ate 


1165 


Thr 


Pro 


Asp 


Phe Thr Asp 


He 






360 






365 




gec 


att 


gee 


ate gac tac 


gac 


1213 


Ala 


He 


Ala 


He Asp Tyr 


Asp 
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370 375 380 

ccg eta gag ggc tat gtc tac tgg aca gat gac gag gtg egg gec ate 12 61 

Pro Leu Glu Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala He 

385 390 395 

cgc agg gcg tac ctg gac ggg tct ggg gcg cag acg ctg gtc aac acc 13 09 

Arg Arg Ala Tyr Leu Asp Gly Ser Gly Ala Gin Thr Leu Val Asn Thr 

400 405 410 

gag ate aac gac ccc gat ggc ate gcg gtc gac tgg gtg gec cga aac 1357 
Glu He Asn Asp Pro Asp Gly He Ala Val Asp Trp Val Ala Arg Asn 

415 " 420 425 

etc tac tgg acc gac acg ggc acg gac cgc ate gag gtg acg cgc etc 1405 
Leu Tyr Trp Thr Asp Thr Gly Thr Asp Arg He Glu Val Thr Arg Leu 
430 435 440 445 

aac ggc acc tec cgc aag ate ctg gtg teg gag gac ctg gac gag ccc 1453 
Asn Gly Thr Ser Arg Lys He Leu Val Ser Glu Asp Leu Asp Glu Pro 

450 455 460 

cga gec ate gca ctg cac ccc gtg atg ggc etc atg tac tgg aca gac 1501 
Arg Ala He Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp 

465 470 475 

tgg gga gag aac cct aaa ate gag tgt gec aac ttg gat ggg cag gag 1545 
Trp Gly Glu Asn Pro Lys He Glu Cys Ala Asn Leu Asp Gly Gin Glu 

480 485 490 

egg cgt gtg ctg gtc aat gee tec etc ggg tgg ccc aac ggc ctg gee 1597 
Arg Arg Val Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala 

495 500 505 

ctg gac ctg cag gag ggg aag etc tac tgg gga gac gec aag aca gac 164 5 

Leu Asp Leu Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp 
510 515 520 525 

aag ate gag gtg ate aat gtt gat ggg acg aag agg egg acc etc ctg 1693 
Lys He Glu Val He Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu 

530 535 540 

gag gac aag etc ccg cac att ttc ggg ttc acg ctg ctg ggg gac ttc 1741 
Glu Asp Lys Leu Pro His He Phe Gly Phe Thr Leu Leu Gly Asp Phe 

545 550 555 

ate tac tgg act gac tgg cag cgc cgc age ate gag egg gtg cac aag 1789 
He Tyr Trp Thr Asp Trp Gin Arg Arg Ser He Glu Arg Val His Lys 

560 565 570 

gtc aag gee age egg gac gtc ate att gac cag ctg ccc gac ctg atg 183 7 

Val Lys Ala Ser Arg Asp Val He He Asp Gin Leu Pro Asp Leu Met 

575 580 585 

ggg etc aaa get gtg aat gtg gec aag gtc gtc gga acc aac ccg tgt 18 85 

Gly Leu Lys Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys 
590 595 600 605 

gcg gac agg aac ggg ggg tgc age cac ctg tgc ttc ttc aca ccc cac 193 3 

Ala Asp Arg Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His 

610 615 620 

gca acc egg tgt ggc tgc ccc ate ggc ctg gag ctg ctg agt gac atg 1981 
Ala Thr Arg Cys Gly Cys Pro He Gly Leu Glu Leu Leu Ser Asp Met 

625 630 635 

aag acc tgc ate gtg cct gag gee ttc ttg gtc ttc acc age aga gec 2029 
Lys Thr Cys He Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala 

640 645 650 

gec ate cac agg ate tec etc gag acc aat aac aac gac gtg gee ate 2 077 

Ala He His Arg He Ser Leu Glu Thr Asn Asn Asn Asp Val Ala He 
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655 



ccg 


etc 


acg 


ggc gtc aag 


Pro 


Leu 


Thr 


Gly Val Lys 


670 






675 


aac 


aac 


cac 


ate tac tgg 


Asn 


Asn 


His 


lie Tyr Trp 








690 


gcc 


ttc 


atg 


aac ggg age 


Ala 


Phe 


Met 


Asn Gly Ser 








705 


gac 


tac 


ccc 


gag ggc atg 


Asp 


Tyr 


Pro 


Glu Gly Met 






720 




tgg 


gcc 


gac 


act ggg acc 


Trp 


Ala 


Asp 


Thr Gly Thr 




735 






cag 


ttc 


egg 


caa gtc etc 


Gin 


Phe 


Arg 


Gin Val Leu 


750 






755 


ctg 


gcc 


ctg 


gat ccc acc 


Leu 


Ala 


Leu 


Asp Pro Thr 








770 


ggc 


aag 


ccg 


agg ate gtg 


Gly 


Lys 


Pro 


Arg He Val 








785 


acg 


ctg 


gtg 


gac aag gtg 


Thr 


Leu 


Val 


Asp Lys Val 






800 




get 


gac 


cag 


cgc etc tac 


Ala 


Asp 


Gin 


Arg Leu Tyr 




815 






teg 


tec 


aac 


atg ctg ggt 


Ser 


Ser 


Asn 


Met Leu Gly 


830 






835 


ccg 


cac 


ccg 


ttc ggt ctg 


Pro 


His 


Pro 


Phe Gly Leu 








850 


gac 


tgg 


aat 


ctg cac age 


Asp 


Trp 


Asn 


Leu His Ser 








865 


aac 


cgc 


ace 


etc ate cag 


Asn 


Arg 


Thr 


Leu He Gin 






880 




gtg 


ttc 


cac 


tec tec cgc 


Val 


Phe 


His 


Ser Ser Arg 




895 






aac 


9gg 


cag 


tgt ggg cag 


Asn 


Gly 


Gin 


Cys Gly Gin 


910 






915 


tgc 


ggc 


tgc 


gcc tea cac 


Cys 


Gly 


Cys 


Ala Ser His 








930 


age 


ccg 


ccc 


acc acc ttc 


Ser 


Pro 


Pro 


Thr Thr Phe 



660 










665 


9 a 9 


gcc 


tea 


gcc 


ctg 


gac 


Glu 


Ala 


Ser 


Ala 


Leu 


Asp 










680 




aca 


gac 


gtc 


age 


ctg 


aag 


Thr 


Asp 


Val 


Ser 


Leu 


Lys 








695 






teg 


gtg 


gag 


cac 


gtg 


gtg 


Ser 


Val 


Glu 


His 


Val 


Val 






710 








gcc 


gtt 


gac 


tgg 


atg 


ggc 


Ala 


Val 


Asp 


Trp 


Met 


Gly 




725 










aac 


aga 


ate 


gaa 


gtg 


gcg 


Asn 


Arg 


He 


Glu 


Val 


Ala 


740 










745 


gtg 


tgg 


agg 


gac 


ttg 


gac 


Val 


Trp 


Arg 


Asp 


Leu 


Asp 










760 




aag 


ggc 


tac 


ate 


tac 


tgg 


Lys 


Gly 


Tyr 


He 


Tyr 


Trp 








775 






egg 


gcc 


ttc 


atg 


gac 


ggg 


Arg 


Ala 


Phe 


Met 


Asp 


Gly 






790 








ggc 


egg 


gcc 


aac 


gac 


etc 


Gly 


Arg 


Ala 


Asn 


Asp 


Leu 




805 










tgg 


acc 


gac 


ctg 


gac 


acc 


Trp 


Thr 


Asp 


Leu 


Asp 


Thr 


820 










825 


cag 


gag 


egg 


gtc 


gtg 


att 


Gin 


Glu 


Arg 


Val 


Val 


He 










840 




acg 


cag 


tac 


age 


gat 


tat 


Thr 


Gin 


Tyr 


Ser 


Asp 


Tyr 








855 






att 


gag 


egg 


gcc 


gac 


aag 


He 


Glu 


Arg 


Ala 


Asp 


Lys 






870 








ggc 


cac 


ctg 


gac 


ttc 


gtg 


Gly 


His 


Leu 


Asp 


Phe 


Val 




885 










cag 


gat 


ggc 


etc 


aat 


gac 


Gin 


Asp 


Gly 


Leu 


Asn 


Asp 


900 










905 


ctg 


tgc 


ctt 


gcc 


ate 


ccc 


Leu 


Cys 


Leu 


Ala 


He 


Pro 










920 




tac 


acc 


ctg 


gac 


ccc 


age 


Tyr 


Thr 


Leu 


Asp 


Pro 


Ser 








935 






ttg 


ctg 


ttc 


age 


cag 


aaa 


Leu 


Leu 


Phe 


Ser 


Gin 


Lys 



ttt gat gtg tec 2125 
Phe Asp Val Ser 
685 

acc ate age cgc 2173 
Thr He Ser Arg 
700 

gag ttt ggc ctt 2221 
Glu Phe Gly Leu 
715 

aag aac etc tac 2269 

Lys Asn Leu Tyr 

730 

egg ctg gac ggg 2317 
Arg Leu Asp Gly 

aac ccg agg teg 2365 
Asn Pro Arg Ser 
765 

acc gag tgg ggc 2413 
Thr Glu Trp Gly 
780 

acc aac tgc atg 2461 
Thr Asn Cys Met 
795 

acc att gac tac 2509 

Thr He Asp Tyr 

810 

aac atg ate gag 2557 
Asn Met He Glu 

gcc gac gat etc 2 605 

Ala Asp Asp Leu 
845 

ate tac tgg aca 2653 
He Tyr Trp Thr 
860 

act age ggc egg 27 01 

Thr Ser Gly Arg 
875 

atg gac ate ctg 2749 

Met Asp lie Leu 

890 

tgt atg cac aac 2797 
Cys Met His Asn 

ggc ggc cac cgc 2 845 

Gly Gly His Arg 
925 

age cgc aac tgc 2 893 

Ser Arg Asn Cys 
940 

tct gcc ate agt 2941 
Ser Ala He Ser 
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2989 



3085 



945 950 955 

egg atg ate ccg gac gac cag cac age ccg gat etc ate ctg ccc ctg 
Arg Met lie Pro Asp Asp Gin His Ser Pro Asp Leu lie Leu Pro Leu 

960 965 970 

cat gga ctg agg aac gtc aaa gee ate gac tat gac cca ctg gac aag 3 037 

His Gly Leu Arg Asn Val Lys Ala lie Asp Tyr Asp Pro Leu Asp Lys 

975 980 985 

ttc ate tac tgg gtg gat ggg cgc cag aac ate aag cga gec aag gac 
Phe lie Tyr Trp Val Asp Gly Arg Gin Asn lie Lys Arg Ala Lys Asp 
990 995 1000 1005 

gac ggg acc cag ccc ttt gtt ttg acc tct ctg age caa ggc caa aac 3133 
Asp Gly Thr Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn 

1010 1015 1020 

cca gac agg cag ccc cac gac etc age ate gac ate tac age egg aca 3181 
Pro Asp Arg Gin Pro His Asp Leu Ser He Asp He Tyr Ser Arg Thr 

1025 1030 1035 

ctg ttc tgg acg tgc gag gec acc aat acc ate aac gtc cac agg ctg 3229 
Leu Phe Trp Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu 

1040 1045 1050 

age ggg gaa gee atg ggg gtg gtg ctg cgt ggg gac cgc gac aag ccc 3277 
Ser Gly Glu Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro 

1055 1060 1065 

agg gee ate gtc gtc aac gcg gag cga ggg tac ctg tac ttc acc aac 3325 
Arg Ala He Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn 
1070 1075 1080 1085 

atg cag gac egg gca gec aag ate gaa cgc gca gec ctg gac ggc acc 3373 
Met Gin Asp Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr 

1090 1095 1100 

gag cgc gag gtc etc ttc acc acc ggc etc ate cgc cct gtg gee ctg 3421 
Glu Arg Glu Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu 

1105 1110 1H5 

gtg gtg gac aac aca ctg ggc aag ctg ttc tgg gtg gac gcg gac ctg 3469 
Val Val Asp Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu 

1120 1125 1130 

aag cgc att gag age tgt gac ctg tea ggg gec aac cgc ctg acc ctg 3 517 

Lys Arg He Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu 

1135 1140 1145 

gag gac gec aac ate gtg cag cct ctg ggc ctg acc ate ctt ggc aag 3565 
Glu Asp Ala Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys 
1150 1155 H60 H65 

cat etc tac tgg ate gac cgc cag cag cag atg ate gag cgt gtg gag 3 613 

His Leu Tyr Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu 

1170 1175 1180 

aag acc acc ggg gac aag egg act cgc ate cag ggc cgt gtc gee cac 3661 
Lys Thr Thr Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His 

1185 1190 1195 

etc act ggc ate cat gca gtg gag gaa gtc age ctg gag gag ttc tea 37 09 

Leu Thr Gly He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser 

1200 1205 1210 

gec cac cca tgt gec cgt gac aat ggt ggc tgc tec cac ate tgt att 3757 
Ala His Pro Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He 

1215 1220 1225 

gec aag ggt gat ggg aca cca egg tgc tea tgc cca gtc cac etc gtg 
Ala Lys Gly Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val 



3805 
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1230 1235 1240 1245 



etc 


ctg 


cag 


aac 


ctg 


ctg 


ace 


tgt 


gga 


gag 


ccg 


ccc 


acc 


tgc 


tec 


ccg 


3853 


Leu 


Leu 


Gin Asn 


Leu 


Leu 


Thr 


Cys Gly Glu Pro 


Pro 


Thr 


Cys 


Ser 


Pro 












1250 








1255 








1260 




gac 


cag 


ttt 


gca 


tgt 


gee 


aca 


ggg 


gag 


ate 


gac 


tgt 


ate 


ccc 


ggg 


gec 


3901 


Asp 


Gin 


Phe 


Ala 


Cys 


Ala 


Thr 


Gly Glu He 


Asp 


Cys 


He 


Pro 


Gly 


Ala 










1265 








1270 








1275 






tgg 


cgc 


tgt 


gac 


ggc 


ttt 


ccc 


gag tgc 


gat 


gac 


cag 


age 


gac 


gag 


gag 


3949 


Trp 


Arg 


Cys 


Asp 


Gly 


Phe 


Pro 


Glu Cys 


Asp 


Asp 


Gin 


Ser 


Asp 


Glu 


Glu 








1280 








1285 








1290 








ggc 


tgc 


ccc 


gtg 


tgc 


tec 


gee 


gee 


cag 


ttc 


ccc 


tgc 


gcg 


egg 


ggt 


cag 


3997 


Gly Cys 


Pro 


val 


Cys 


Ser 


Ala 


Ala 


Gin 


Phe 


Pro 


Cys 


Ala 


Arg 


Gly 


Gin 






1295 








1300 








1305 










tgt 


gtg 


gac 


ctg 


cgc 


ctg 


cgc 


tgc 


gac 


ggc 


gag 


gca 


gac 


tgt 


cag 


gac 


4045 


Cys 


Val 


Asp 


Leu 


Arg 


Leu 


Arg 


Cys 


Asp 


Gly 


Glu 


Ala 


Asp 


Cys 


Gin 


Asp 




1310 








1315 








1320 








1325 




cgc 


tea 


gac 


gag 


gtg 


gac 


tgt 


gac 


gec 


ate 


tgc 


ctg 


ccc 


aac 


cag 


ttc 


4093 


Arg 


Ser 


Asp 


Glu 


Val 


Asp 


Cys 


Asp 


Ala 


He 


Cys 


Leu 


Pro 


Asn 


Gin 


Phe 












1330 








1335 








1340 




egg 


tgt 


gcg 


age 


ggc 


cag 


tgt 


gtc 


etc 


ate 


aaa 


cag 


cag 


tgc 


gac 


tec 


4141 


Arg Cys 


Ala 


Ser 


Gly 


Gin 


Cys 


Val 


Leu 


He 


Lys 


Gin 


Gin 


Cys 


Asp 


Ser 










1345 








1350 








1355 






ttc 


ccc 


gac 


tgt 


ate 


gac 


ggc 


tec 


gac 


gag 


etc 


atg 


tgt 


gaa 


ate 


acc 


4189 


Phe 


Pro 


Asp 


Cys 


He 


Asp 


Gly 


Ser 


Asp 


Glu 


Leu 


Met 


Cys 


Glu 


He 


Thr 








1360 








1365 








1370 








aag 


ccg 


ccc 


tea 


gac 


gac 


age 


ccg 


gec 


cac 


age 


agt 


gec 


ate 


ggg 


ccc 


4237 


Lys 


Pro 


Pro 


Ser 


Asp 


Asp 


Ser 


Pro 


Ala 


His 


Ser 


Ser 


Ala 


He 


Gly 


Pro 






1375 








1380 








1385 










gtc 


att 


ggc 


ate 


ate 


etc 


tct 


etc 


ttc 


gtc 


atg 


ggt 


ggt 


gtc 


tat 


ttt 


4285 


Val 


lie 


Gly 


He 


He 


Leu 


Ser 


Leu 


Phe 


Val 


Met 


Gly 


Gly 


Val 


Tyr 


Phe 




.13 90 








1395 








1400 








1405 




gtg 


tgc 


cag 


cgc 


gtg 


gtg 


tgc 


cag 


cgc 


tat 


gcg 


ggg 


gec 


aac 


ggg 


ccc 


4333 


Val 


Cys 


Gin 


Arg 


Val 


Val 


Cys 


Gin 


Arg 


Tyr 


Ala 


Gly 


Ala 


Asn 


Gly 


Pro 












1410 








1415 








1420 




ttc 


ccg 


cac 


gag 


tat 


gtc 


age 


ggg 


ace 


ccg 


cac 


gtg 


ccc 


etc 


aat 


ttc 


4381 


Phe 


Pro 


His 


Glu 


Tyr 


Val 


Ser 


Gly 


Thr 


Pro 


His 


Val 


Pro 


Leu 


Asn 


Phe 










1425 








1430 








1435 






ata 


gec 


ccg 


ggc 


ggt 


tec 


cag 


cat 


ggc 


ccc 


ttc 


aca 


ggc 


ate 


gca 


tgc 


4429 


lie 


Ala 


Pro 


Gly 


Gly 


Ser 


Gin 


His 


Gly 


Pro 


Phe 


Thr 


Gly 


He 


Ala 


Cys 








1440 








1445 








1450 








gga 


aag 


tec 


atg 


atg 


age 


tec 


gtg 


age 


ctg 


atg 


ggg 


ggc 


egg 


ggc 


ggg 


4477 


Gly Lys 


Ser 


Met 


Met 


Ser 


Ser 


Val 


Ser 


Leu 


Met 


Gly 


Gly 


Arg 


Gly 


Gly 






1455 








1460 








1465 










gtg 


ccc 


etc 


tac 


gac 


egg 


aac 


cac 


gtc 


aca 


ggg 


gec 


teg 


tec 


age 


age 


4525 


Val 


Pro 


Leu 


Tyr 


Asp 


Arg 


Asn 


His 


Val 


Thr 


Gly 


Ala 


Ser 


Ser 


Ser 


Ser 




1470 








1475 








1480 








1485 




teg 


tec 


age 


acg 


aag 


gee 


acg 


ctg 


tac 


ccg 


ccg 


ate 


ctg 


aac 


ccg 


ccg 


4573 


Ser 


Ser 


Ser 


Thr 


Lys 


Ala 


Thr 


Leu 


Tyr 


Pro 


Pro 


He 


Leu 


Asn 


Pro 


Pro 












1490 








1495 








1500 




ccc 


tec 


ccg 


gee 


acg 


gac 


ccc 


tec 


ctg 


tac 


aac 


atg 


gac 


atg 


ttc 


tac 


4621 


Pro 


Ser 


Pro 


Ala 


Thr 


Asp 


Pro 


Ser 


Leu 


Tyr Asn Met 


Asp 


Met 


Phe 


Tyr 










1505 








1510 








1515 






tct 


tea 


aac 


att 


ccg 


gee 


act 


gcg 


aga 


ccg 


tac 


agg 


ccc 


tac 


ate 


att 


4669 


Ser 


Ser 


Asn 


He 


Pro 


Ala 


Thr 


Ala 


Arg Pro Tyr Arg 


Pro 


Tyr 


He 


He 
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1520 1525 1530 

cga gga atg gcg ccc ccg acg acg ccc tgc age acc gac gtg tgt gac 4717 
Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp 

1535 1540 1545 

age gac tac age gec age cgc tgg aag gee age aag tac tac ctg gat 4765 
Ser Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp 
1550 1555 1560 1565 

ttg aac teg gac tea gac ccc tat cca ccc cca ccc acg ccc cac age 4813 
Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

1570 1575 1580 

cag tac ctg teg gcg gag gac age tgc ccg ccc teg ccc gee acc gag 4861 
Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 

1585 1590 1595 

agg age tac ttc cat etc ttc ccg ccc cct ccg tec ccc tgc acg gac 4909 
Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp 

1600 1605 1610 

tea tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4965 
Ser Ser 
1615 

aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5025 
taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 5085 
tggagaaata tttataaact taattttgta aaaca 512 0 



<210> 2 

<211> 5120 

<212> DNA 

<213> Homo sapiens 



<400> 2 

actaaagege cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 
Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 
1 5 10 



ctg 


ctg 


ctg 


ctg 


ctg 


ctg 


ctg gcg 


ctg 


tgc 


ggc 


tgc 


ccg 


gee 


ccc 


gee 


157 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu Ala 


Leu 


Cys 


Gly 


Cys 


Pro 


Ala 


Pro 


Ala 






15 










20 








25 












gcg 


gee 


teg 


ccg 


etc 


ctg 


eta ttt 


gee 


aac 


cgc 


egg 


gac 


gta 


egg 


ctg 


205 


Ala 


Ala 


Ser 


Pro 


Leu 


Leu 


Leu Phe 


Ala 


Asn Arg 


Arg 


Asp 


Val 


Arg 


Leu 




30 










35 








40 










45 




gtg 


gac 


gee 


ggc 


gga 


gtc 


aag ctg 


gag 


tec 


acc 


ate 


gtg 


gtc 


age 


ggc 


253 


Val 


Asp 


Ala 


Gly 


Gly 


Val 


Lys Leu 


Glu 


Ser 


Thr 


He 


Val 


Val 


Ser 


Gly 












50 








55 










60 






ctg 


gag 


gat 


gcg 


gee 


gca 


gtg gac 


ttc 


cag 


ttt 


tec 


aag 


gga 


gee 


gtg 


301 


Leu 


Glu 


Asp 


Ala 


Ala 


Ala 


Val Asp 


Phe 


Gin 


Phe 


Ser 


Lys 


Gly 


Ala 


Val 










65 








70 










75 








tac 


tgg 


aca 


gac 


gtg 


age 


gag gag 


gee 


ate 


aag 


cag 


acc 


tac 


ctg 


aac 


349 


Tyr 


Trp 


Thr 


Asp 


Val 


Ser 


Glu Glu 


Ala 


He 


Lys 


Gin 


Thr 


Tyr 


Leu 


Asn 






80 








85 










90 










cag 


acg 


ggg 


gee 


gee 


gtg 


cag aac 


gtg 


gtc 


ate 


tec 


ggc 


ctg 


gtc 


tct 


397 


Gin 


Thr 


Gly 


Ala 


Ala 


Val 


Gin Asn 


Val 


Val 


He 


Ser 


Gly 


Leu 


Val 


Ser 






95 










100 








105 












ccc 


gac 


ggc 


etc 


gee 


tgc 


gac tgg 


gtg 


ggc 


aag 


aag 


ctg 


tac 


tgg 


acg 


445 


Pro 


Asp 


Gly 


Leu 


Ala 


Cys 


Asp Trp 


Val 


Gly Lys 


Lys 


Leu 


Tyr 


Trp 


Thr 




110 










115 








120 










125 
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gac tea 


gag 


ace 


aac 


cgc 


ate 


gag 


gtg 


gec 


aac 


etc 


aat 


ggc 


aca 


tec 


493 


Asp Ser 


Glu 


Thr 


Asn 


Arg 


He 


Glu 


Val 


Ala Asn Leu 


Asn 


Gly 


Thr 


Ser 










13 0 










135 










14 0 






egg aag 


gtg 


etc 


ttc 


tgg 


cag 


gac 


ctt 


gac 


cag 


ccg 


agg 


gec 


ate 


gee 


541 


Arg Lys 


Val 


Leu 


Phe 


Trp 


Gin 


Asp 


Leu 


Asp Gin Pro 


Arg 


Ala 


He 


Ala 








145 










150 










155 








ttg gac 


ccc 


get 


cac 


ggg 


tac 


atg 


tac 


tgg 


aca 


gac 


tgg 


gtt 


gag 


acg 


589 


Leu Asp 


Pro 


Ala 


His 


Gly 


Tyr 


Met 


Tyr 


Trp Thr Asp 


Trp 


Val 


Glu 


Thr 






160 










165 










17 0 










ccc egg 


att 


gag 


egg 


gca 


ggg 


atg 


gat 


ggc 


age 


ace 


egg 


aag 


ate 


att 


637 


Pro Arg 


He 


Glu 


Arg 


Ala 


Gly 


Met 


Asp 


Gly Ser Thr 


Arg 


Lys 


He 


He 




175 










180 










185 












gtg gac 


teg 


gac 


att 


tac 


tgg 


ccc 


aat 


gga 


ctg ace 


ate 


gac 


ctg 


gag 


685 


Val Asp 


Ser 


Asp 


He 


Tyr 


Trp 


Pro 


Asn 


Gly Leu Thr 


He 


Asp 


Leu 


Glu 




190 








195 










200 










205 




gag cag 


aag 


etc 


tac 


tgg 


get 


gac 


gec 


aag 


etc 


age 


ttc 


ate 


cac 


cgt 


733 


Glu Gin 


Lys 


Leu 


Tyr 
210 


Trp 


Ala 


Asp 


Ala 


Lys 
215 


Leu 


Ser 


Phe 


He 


His 
220 


Arg 




gec aac 


ctg 


gac 


ggc 


teg 


ttc 


egg 


cag 


aag 


gtg 


gtg 


gag 


ggc 


age 


ctg 


781 


Ala Asn 


Leu 


Asp 
225 


Gly 


Ser 


Phe 


Arg 


Gin 
230 


Lys 


Val 


Val 


Glu 


Gly 
235 


Ser 


Leu 




acg cac 


ccc 


ttc 


gee 


ctg 


acg 


etc 


tec 


ggg 


gac 


act 


ctg 


tac 


tgg 


aca 


829 


Thr His 


Pro 


Phe 


Ala 


Leu 


Thr 


Leu 


Ser 


Gly Asp 


Thr 


Leu 


Tyr 


Trp 


Thr 






240 










245 










250 










gac tgg 


cag 


ace 


cgc 


tec 


ate 


cat 


gec 


tgc 


aac 


aag 


cgc 


act 


ggg 


ggg 


877 


Asp Trp 


Gin 


Thr 


Arg 


Ser 


He 


His 


Ala 


Cys 


Asn Lys 


Arg 


Thr 


Gly 


Gly 




255 










260 










2 65 












aag agg 


aag 


gag 


ate 


ctg 


agt 


gee 


etc 


tac 


tea 


ccc 


atg 


gac 


ate 


cag 


925 


Lys Arg 


Lys 


Glu 


lie 


Leu 


Ser 


Ala 


Leu 


Tyr 


Ser 


Pro 


Met 


Asp 


He 


Gin 




270 








275 










280 










285 




gtg ctg 


age 


cag 


gag 


egg 


cag 


cct 


ttc 


ttc 


cac 


act 


cgc 


tgt 


gag 


gag 


973 


Val Leu 


Ser 


Gin 


Glu 
290 


Arg 


Gin 


Pro 


Phe 


Phe 
295 


His 


Thr 


Arg 


Cys 


Glu 
300 


Glu 




gac aat 


ggc 


ggc 


tgc 


tec 


cac 


ctg 


tgc 


ctg 


ctg 


tec 


cca 


age 


gag 


cct 


1021 


Asp Asn 


Gly 


Gly 
305 


Cys 


Ser 


His 


Leu 


Cys 
310 


Leu 


Leu 


Ser 


Pro 


Ser 
315 


Glu 


Pro 




ttc tac 


aca 


tgc 


gee 


tgc 


ccc 


acg 


ggt 


gtg 


cag 


ctg 


cag 


gac 


aac 


ggc 


1069 


Phe Tyr 


Thr 
320 


Cys 


Ala 


Cys 


Pro 


Thr 
325 


Gly 


Val 


Gin 


Leu 


Gin 
330 


Asp 


Asn 


Gly 




agg acg 


tgt 


aag 


gca 


gga 


gee 


gag 


gag 


gtg 


ctg 


ctg 


ctg 


gee 


egg 


egg 


1117 


Arg Thr 


Cys 


Lys 


Ala 


Gly 


Ala 


Glu 


Glu 


Val 


Leu 


Leu 


Leu 


Ala 


Arg 


Arg 




335 










340 










345 












acg gac 


eta 


egg 


agg 


ate 


teg 


ctg 


gac 


acg 


ccg 


gac 


ttc 


ace 


gac 


ate 


1165 


Thr Asp 


Leu 


Arg 


Arg 


He 


Ser 


Leu 


Asp 


Thr 


Pro Asp 


Phe 


Thr 


Asp 


He 




350 








355 










360 










365 




gtg ctg 


cag 


gtg 


gac 


gac 


ate 


egg 


cac 


gee 


att 


gec 


ate 


gac 


tac 


gac 


1213 


Val Leu 


Gin 


Val 


Asp 
370 


Asp 


He 


Arg 


His 


Ala 
375 


He 


Ala 


He 


Asp 


Tyr 
380 


Asp 




ccg eta 


gag 


ggc 


tat 


gtc 


tac 


tgg 


aca 


gat 


gac 


gag 


gtg 


egg 


gec 


ate 


1261 


Pro Leu 


Glu 


Gly 
385 


Tyr 


Val 


Tyr 


Trp 


Thr 
390 


Asp 


Asp 


Glu 


Val 


Arg 
395 


Ala 


He 




cgc agg 


gcg 


tac 


ctg 


gac 


ggg 


tct 


ggg 


gcg 


cag 


acg 


ctg 


gtc 


aac 


acc 


1309 


Arg Arg 


Ala 
400 


Tyr 


Leu 


Asp 


Gly 


Ser 
405 


Gly 


Ala 


Gin 


Thr 


Leu 
410 


Val 


Asn 


Thr 
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gag 


ate 


aac 


gac 


ccc 


gat 


ggc 


ate 


gcg 


Glu 


He 


Asn 


Asp 


Pro 


Asp 


Gly 


He 


Ala 




415 










420 






etc 


tac 


tgg 


acc 


gac 


acg 


ggc 


acg 


gac 


Leu 


Tyr 


Trp 


Thr 


Asp 


Thr 


Gly 


Thr 


Asp 


430 










435 








aac 


ggc 


acc 


tec 


cgc 


aag 


ate 


ctg 


gtg 


Asn Gly 


Thr 


Ser 


Arg 


Lys 


He 


Leu 


Val 










450 










cga 


gee 


ate 


gca 


ctg 


cac 


ccc 


gtg 


atg 


Arg Ala 


He 


Ala 


Leu 


His 


Pro 


Val 


Met 








465 










470 


tgg 


gga 


gag 


aac 


cct 


aaa 


ate 


gag 


tgt 


Trp 


Gly 


Glu 


Asn 


Pro 


Lys 


He 


Glu 


Cys 






480 










485 




egg 


cgt 


gtg 


ctg 


gtc 


aat 


gee 


tec 


etc 


Arg Arg 


Val 


Leu 


Val 


Asn 


Ala 


Ser 


Leu 




495 










500 






ctg gac 


ctg 


cag 


gag 


ggg 


aag 


etc 


tac 


Leu 


Asp 


Leu 


Gin 


Glu 


Gly 


Lys 


Leu 


Tyr 


510 










515 








aag 


ate 


gag 


gtg 


ate 


aat 


gtt 


gat 


ggg 


Lys 


He 


Glu 


Val 


He 


Asn 


Val 


Asp 


Gly 










530 










gag 


gac 


aag 


etc 


ccg 


cac 


att 


ttc 


ggg 


Glu Asp 


Lys 


Leu 


Pro 


His 


He 


Phe 


Gly 








545 










550 


ate 


tac 


tgg 


act 


gac 


tgg 


cag 


cgc 


cgc 


He 


Tyr 


Trp 


Thr 


Asp 


Trp 


Gin 


Arg 


Arg 






560 










565 




gtc 


aag 


gee 


age 


egg 


gac 


gtc 


ate 


att 


Val 


Lys 


Ala 


Ser 


Arg 


Asp 


Val 


He 


He 




575 










580 






ggg 


etc 


aaa 


get 


gtg 


aat 


gtg 


gee 


aag 


Gly Leu 


Lys 


Ala 


Val 


Asn 


Val 


Ala 


Lys 


590 










595 








gcg 


gac 


agg 


aac 


ggg 


ggg 


tgc 


age 


cac 


Ala 


Asp 


Arg 


Asn 


Gly 


Gly 


Cys 


Ser 


His 










610 










gca 


acc 


egg 


tgt 


ggc 


tgc 


ccc 


ate 


ggc 


Ala 


Thr 


Arg 


Cys 


Gly 


Cys 


Pro 


He 


Gly 








625 










630 


aag 


acc 


tgc 


ate 


gtg 


cct 


gag 


gee 


ttc 


Lys 


Thr 


Cys 


lie 


Val 


Pro 


Glu 


Ala 


Phe 






640 










645 




gec 


ate 


cac 


agg 


ate 


tec 


etc 


gag 


acc 


Ala 


He 


His 


Arg 


He 


Ser 


Leu 


Glu 


Thr 




655 










660 






ccg 


etc 


acg 


ggc 


gtc 


aag 


gag 


gec 


tea 


Pro 


Leu 


Thr 


Gly 


Val 


Lys 


Glu 


Ala 


Ser 


670 










675 








aac 


aac 


cac 


ate 


tac 


tgg 


aca 


gac 


gtc 


Asn 


Asn 


His 


He 


Tyr 


Trp 


Thr 


Asp 


Val 



690 
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gtc gac 


tgg 


gtg 


gee 


cga 


aac 


1357 


Val Asp 


Trp 


Val 


Ala 


Arg 


Asn 






425 












cgc ate 


gag 


gtg 


acg 


cgc 


etc 


1405 


Arg He 


Glu 


Val 


Thr 


Arg 


Leu 




440 










445 




teg gag 


gac 


ctg 


gac 


gag 


ccc 


1453 


Ser Glu 


Asp 


Leu 


Asp 


Glu 


Pro 




455 








460 






ggc etc 


atg 


tac 


tgg 


aca 


gac 


1501 


Gly Leu 


Met 


Tyr 


Trp 


Thr 


Asp 










475 








gec aac 


ttg 


gat 


ggg 


cag 


gag 


1549 


Ala Asn 


Leu 


Asp 


Gly 


Gin 


Glu 








490 










ggg tgg 


ccc 


aac 


ggc 


ctg 


gec 


1597 


Gly Trp 


Pro 


Asn 


Gly 


Leu 


Ala 






505 












tgg gga 


gac 


gee 


aag 


aca 


gac 


1645 


Trp Gly 


Asp 


Ala 


Lys 


Thr 


Asp 




520 










525 




acg aag 


agg 


egg 


acc 


etc 


ctg 


1693 


Thr Lys 


Arg 


Arg 


Thr 


Leu 


Leu 




535 








540 






ttc acg 


ctg 


ctg 


ggg 


gac 


ttc 


1741 


Phe Thr 


Leu 


Leu 


Gly 


Asp 


Phe 










555 








age ate 


gag 


egg 


gtg 


cac 


aag 


1789 


Ser He 


Glu 


Arg 


Val 


His 


Lys 








570 










gac cag 


ctg 


ccc 


gac 


ctg 


atg 


1837 


Asp Gin 


Leu 


Pro 


Asp 


Leu 


Met 






585 












gtc gtc 


gga 


acc 


aac 


ccg 


tgt 


1885 


Val Val 


Gly 


Thr 


Asn 


Pro 


Cys 




600 










605 




ctg tgc 


ttc 


ttc 


aca 


ccc 


cac 


1933 


Leu Cys 


Phe 


Phe 


Thr 


Pro 


His 




615 








620 






ctg gag 


ctg 


ctg 


agt 


gac 


atg 


1981 


Leu Glu 


Leu 


Leu 


Ser 


Asp 


Met 










635 








ttg gtc 


ttc 


acc 


age 


aga 


gec 


2029 


Leu Val 


Phe 


Thr 


Ser 


Arg 


Ala 








650 










aat aac 


aac 


gac 


gtg 


gee 


ate 


2077 


Asn Asn 


Asn 


Asp 


Val 


Ala 


He 






665 












gee ctg 


gac 


ttt 


gat 


gtg 


tec 


2125 


Ala Leu 


Asp 


Phe 


Asp 


Val 


Ser 




680 










685 




age ctg 


aag 


acc 


ate 


age 


cgc 


2173 


Ser Leu 


Lys 


Thr 


He 


Ser 


Arg 




695 








700 
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gcc 


ttc 


atg 


aac 


ggg 


age 


teg 


gtg 


gag 


cac 


gtg 


gtg 


gag 


ttt 


ggc 


ctt 


2221 


Ala 


Phe 


Met 


Asn 


Gly 


Ser 


Ser 


Val 


Glu 


His 


Val 


Val 


Glu 


Phe 


Gly 


Leu 










705 










710 










715 








gac 


tac 


ccc 


gag 


ggc 


atg 


gcc 


gtt 


gac 


tgg 


atg 


ggc 


aag 


aac 


etc 


tac 


2269 


Asp 


Tyr 


Pro 


Glu 


Gly 


Met 


Ala 


Val 


Asp 


Trp 


Met 


Gly 


Lys 


Asn 


Leu 


Tyr 








720 










725 










730 










tgg 


gcc 


gac 


act 


ggg 


acc 


aac 


aga 


ate 


gaa 


gtg 


gcg 


egg 


ctg 


gac 


ggg 


2317 


Trp 


Ala 


Asp 


Thr 


Gly 


Thr 


Asn 


Arg 


He 


Glu 


Val 


Ala 


Arg 


Leu 


Asp 


Gly 






735 










740 










745 












cag 


ttc 


egg 


caa 


gtc 


etc 


gtg 


tgg 


agg 


gac 


ttg 


gac 


aac 


ccg 


agg 


teg 


2365 


Gin 


Phe 


Arg 


Gin 


Val 


Leu 


Val 


Trp 


Arg 


Asp 


Leu 


Asp 


Asn 


Pro 


Arg 


Ser 




750 










755 










760 










765 




ctg 


gcc 


ctg 


gat 


ccc 


acc 


aag 


ggc 


tac 


ate 


tac 


tgg 


acc 


gag 


tgg 


ggc 


2413 


Leu 


Ala 


Leu 


Asp 


Pro 


Thr 


Lys 


Gly 


Tyr 


He 


Tyr 


Trp 


Thr 


Glu 


Trp 


Gly 












770 










775 










780 






ggc 


aag 


ccg 


agg 


ate 


gtg 


egg 


gcc 


ttc 


atg 


gac 


ggg 


acc 


aac 


tgc 


atg 


2461 


Gly 


Lys 


Pro 


Arg 


He 


Val 


Arg 


Ala 


Phe 


Met 


Asp 


Gly 


Thr 


Asn 


Cys 


Met 










785 










790 










795 








acg 


ctg 


gtg 


gac 


aag 


gtg 


ggc 


egg 


gcc 


aac 


gac 


etc 


acc 


att 


gac 


tac 


2509 


Thr 


Leu 


Val 


Asp 


Lys 


Val 


Gly 


Arg 


Ala 


Asn 


Asp 


Leu 


Thr 


He 


Asp 


Tyr 








800 










805 










810 










get 


gac 


cag 


cgc 


etc 


tac 


tgg 


acc 


gac 


ctg 


gac 


acc 


aac 


atg 


ate 


gag 


2557 


Ala 


Asp 


Gin 


Arg 


Leu 


Tyr 


Trp 


Thr 


Asp 


Leu 


Asp 


Thr 


Asn 


Met 


He 


Glu 






815 










820 










825 












teg 


tec 


aac 


atg 


ctg 


ggt 


cag 


gag 


egg 


gtc 


gtg 


att 


gcc 


gac 


gat 


etc 


2605 


Ser 


Ser 


Asn 


Met 


Leu 


Gly 


Gin 


Glu 


Arg 


Val 


Val 


He 


Ala 


Asp 


Asp 


Leu 




830 










835 










84 0 










845 




ccg 


cac 


ccg 


ttc 


ggt 


ctg 


acg 


cag 


tac 


age 


gat 


tat 


ate 


tac 


tgg 


aca 


2653 


Pro 


His 


Pro 


Phe 


Gly 


Leu 


Thr 


Gin 


Tyr 


Ser 


Asp 


Tyr 


He 


Tyr 


Trp 


Thr 












850 










855 










860 






gac 


tgg 


aat 


ctg 


cac 


age 


att 


gag 


egg 


gcc 


gac 


aag 


act 


age 


ggc 


egg 


2701 


Asp 


Trp 


Asn 


Leu 


His 


Ser 


He 


Glu 


Arg 


Ala 


Asp 


Lys 


Thr 


Ser 


Gly 


Arg 










865 










870 










875 








aac 


cgc 


acc 


etc 


ate 


cag 


ggc 


cac 


ctg 


gac 


ttc 


gtg 


atg 


gac 


ate 


ctg 


2749 


Asn 


Arg 


Thr 


Leu 


He 


Gin 


Gly 


His 


Leu 


Asp 


Phe 


Val 


Met 


Asp 


He 


Leu ' 








880 










885 










890 










gtg 


ttc 


cac 


tec 


tec 


cgc 


cag 


gat 


ggc 


etc 


aat 


gac 


tgt 


atg 


cac 


aac 


2797 


Val 


Phe 


His 


Ser 


Ser 


Arg 


Gin 


Asp 


Gly 


Leu 


Asn 


Asp 


Cys 


Met 


His 


Asn 






895 










900 










905 












aac 


ggg 


cag 


tgt 


ggg 


cag 


ctg 


tgc 


ctt 


gcc 


ate 


ccc 


ggc 


ggc 


cac 


cgc 


2845 


Asn 


Gly 


Gin 


Cys 


Gly 


Gin 


Leu 


Cys 


Leu 


Ala 


He 


Pro 


Gly 


Gly 


His 


Arg 




910 










915 










92 0 










925 




tgc 


ggc 


tgc 


gcc 


tea 


cac 


tac 


acc 


ctg 


gac 


ccc 


age 


age 


cgc 


aac 


tgc 


2893 


Cys 


Gly 


Cys 


Ala 


Ser 


His 


Tyr 


Thr 


Leu 


Asp 


Pro 


Ser 


Ser 


Arg 


Asn 


Cys 












930 










935 










94 0 






age 


ccg 


ccc 


acc 


acc 


ttc 


ttg 


ctg 


ttc 


age 


cag 


aaa 


tct 


gcc 


ate 


agt 


2941 


Ser 


Pro 


Pro 


Thr 


Thr 


Phe 


Leu 


Leu 


Phe 


Ser 


Gin 


Lys 


Ser 


Ala 


He 


Ser 










945 










950 










955 








egg 


atg 


ate 


ccg 


gac 


gac 


cag 


cac 


age 


ccg 


gat 


etc 


ate 


ctg 


ccc 


ctg 


2989 


Arg 


Met 


He 


Pro 


Asp 


Asp 


Gin 


His 


Ser 


Pro 


Asp 


Leu 


He 


Leu 


Pro 


Leu 








960 










965 










970 










cat 


gga 


ctg 


agg 


aac 


gtc 


aaa 


gcc 


ate 


gac 


tat 


gac 


cca 


ctg 


gac 


aag 


3037 


His 


Gly 


Leu 


Arg 


Asn 


Val 


Lys 


Ala 


He 


Asp 


Tyr 


Asp 


Pro 


Leu 


Asp 


Lys 





975 980 985 
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ttc 


ate 


tac 


tgg 


gtg gat 


ggg 


cgc 


cag 


aac 


ate 


aag 


cga 


gec 


aag 


gac 


3085 


Phe 


He 


Tyr 


Trp 


Val 


Asp 


Gly 


Arg 


Gin 


Asn 


He Lys 


Arg Ala Lys Asp 




990 










995 










1000 








1005 




gac 


ggg 


acc 


cag 


ccc 


ttt 


gtt 


ttg 


acc 


tct 


ctg 


age 


caa 


ggc 


caa 


aac 


3133 


Asp 


Gly 


Thr 


Gin 


Pro 


Phe 


Val 


Leu 


Thr 


Ser 


Leu 


Ser 


Gin Gly Gin Asn 












1010 








1015 








1020 




cca 


gac 


agg 


cag 


ccc 


cac 


gac 


etc 


age 


ate 


gac 


ate 


tac 


age 


egg 


aca 


3181 


Pro 


Asp 


Arg 


Gin 


Pro 


His 


Asp 


Leu 


Ser 


He 


Asp 


He 


Tyr Ser Arg 


Thr 










1025 








1030 








1035 






ctg 


ttc 


tgg 


acg 


tgc 


gag 


gee 


acc 


aat 


acc 


ate 


aac 


gtc 


cac 


agg 


ctg 


3229 


Leu 


Phe 


Trp 


Thr 


Cys 


Glu 


Ala 


Thr 


Asn 


Thr 


He 


Asn 


Val 


His 


Arg 


Leu 








1040 








1045 








1050 








age 


ggg 


gaa 


gec 


atg 


ggg 


gtg 


gtg 


ctg 


cgt 


ggg 


gac 


cgc 


gac 


aag 


ccc 


3277 


Ser 


Gly 


Glu 


Ala 


Met 


Gly 


Val 


Val 


Leu 


Arg 


Gly 


Asp 


Arg 


Asp 


Lys 


Pro 






1055 








1060 








1065 










agg 


gee 


ate 


gtc 


gtc 


aac 


gcg 


gag 


cga 


ggg 


tac 


ctg 


tac 


ttc 


acc 


aac 


3325 


Arg 


Ala 


He 


Val 


Val 


Asn 


Ala 


Glu 


Arg 


Gly 


Tyr 


Leu 


Tyr 


Phe 


Thr 


Asn 




1070 








1075 








1080 








1085 




atg 


cag 


gac 


egg 


gca 


gec 


aag 


ate 


gaa 


cgc 


gca 


gee 


ctg 


gac 


ggc 


acc 


3373 


Met 


Gin 


Asp 


Arg 


Ala 


Ala 


Lys 


He 


Glu 


Arg 


Ala 


Ala 


Leu 


Asp 


Gly 


Thr 












1090 








1095 








1100 




gag 


cgc 


gag 


gtc 


etc 


ttc 


acc 


acc 


ggc 


etc 


ate 


cgc 


cct 


gtg 


gee 


ctg 


3421 


Glu 


Arg 


Glu 


Val 


Leu 


Phe 


Thr 


Thr 


Gly 


Leu 


He 


Arg 


Pro 


Val 


Ala 


Leu 










1105 








1110 








1115 






gtg 


gtg 


gac 


aac 


aca 


ctg 


ggc 


aag 


ctg 


ttc 


tgg 


gtg 


gac 


gcg 


gac 


ctg 


3469 


val 


Val 


Asp 


Asn 


Thr 


Leu 


Gly 


Lys 


Leu 


Phe 


Trp 


Val 


Asp 


Ala 


Asp 


Leu 








1120 








1125 








1130 








aag 


cgc 


att 


gag 


age 


tgt 


gac 


ctg 


tea 


ggg 


gec 


aac 


cgc 


ctg 


acc 


ctg 


3517 


Lys 


Arg 


He 


Glu 


Ser 


Cys 


Asp 


Leu 


Ser 


Gly 


Ala 


Asn 


Arg 


Leu 


Thr 


Leu 






1135 








1140 








1145 










gag 


gac 


gec 


aac 


ate 


gtg 


cag 


cct 


ctg 


ggc 


ctg 


acc 


ate 


ctt 


ggc 


aag 


3565 


Glu 


Asp 


Ala 


Asn 


He 


Val 


Gin 


Pro 


Leu 


Gly 


Leu 


Thr 


He 


Leu 


Gly 


Lys 




1150 








1155 








1160 








1165 




cat 


etc 


tac 


tgg 


ate 


gac 


cgc 


cag 


cag 


cag 


atg 


ate 


gag 


cgt 


gtg 


gag 


3613 


His 


Leu 


Tyr 


Trp 


He 


Asp 


Arg 


Gin 


Gin 


Gin 


Met 


He 


Glu 


Arg 


Val 


Glu 












1170 








1175 








1180 




aag 


acc 


acc 


ggg 


gac 


aag 


egg 


act 


cgc 


ate 


cag 


ggc 


cgt 


gtc 


gee 


cac 


3661 


Lys 


Thr 


Thr 


Gly 


Asp 


Lys 


Arg 


Thr 


Arg 


He 


Gin 


Gly 


Arg 


Val 


Ala 


His 










1185 








1190 








1195 






etc 


act 


ggc 


ate 


cat 


gca 


gtg 


gag 


gaa 


gtc 


age 


ctg 


gag 


gag 


ttc 


tea 


3709 


Leu 


Thr 


Gly 


He 


His 


Ala 


Val 


Glu 


Glu 


Val 


Ser 


Leu 


Glu 


Glu 


Phe 


Ser 








1200 








1205 








1210 








gec 


cac 


cca 


tgt 


gec 


cgt 


gac 


aat 


ggt 


ggc 


tgc 


tec 


cac 


ate 


tgt 


att 


3757 


Ala 


His 


Pro 


Cys 


Ala 


Arg 


Asp 


Asn 


Gly 


Gly 


Cys 


Ser 


His 


He 


Cys 


He 






1215 








1220 








1225 










gec 


aag ggt 


gat 


ggg 


aca 


cca 


egg 


tgc 


tea 


tgc 


cca 


gtc 


cac 


etc 


gtg 


3805 


Ala 


Lys Gly 


Asp 


Gly 


Thr 


Pro 


Arg 


Cys 


Ser 


Cys 


Pro 


Val 


His 


Leu 


Val 




1230 








1235 








1240 








1245 




etc 


ctg 


cag 


aac 


ctg 


ctg 


acc 


tgt 


gga 


gag 


ccg 


ccc 


acc 


tgc 


tec 


ccg 


3853 


Leu 


Leu 


Gin 


Asn 


Leu 


Leu 


Thr 


Cys 


Gly 


Glu 


Pro 


Pro 


Thr 


Cys 


Ser 


Pro 












1250 








1255 








1260 




gac 


cag 


ttt 


gca 


tgt 


gec 


aca 


ggg 


gag 


ate 


gac 


tgt 


ate 


ccc 


ggg 


gee 


3901 


Asp 


Gin 


Phe 


Ala 


Cys 


Ala Thr Gly 


Glu 


He 


Asp 


Cys 


He 


Pro 


Gly Ala 





1265 1270 1275 
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tgg 


cgc 


tgt 


gac 


ggc 


ttt 


ccc 


gag 


tgc 


gat 


gac 


cag 


age 


gac 


gag 


gag 


3949 


Trp Arg Cys 


Asp 


Gly 


Phe 


Pro 


Glu 


Cys 


Asp 


Asp Gin Ser Asp 


Glu 


Glu 








1280 








1285 








1290 








ggc 


tgc 


ccc 


gtg 


tgc 


tec 


gec 


gee 


cag 


ttc 


ccc 


tgc 


gcg 


egg 


ggt 


cag 


3997 


Gly Cys 


Pro 


Val 


Cys 


Ser 


Ala 


Ala 


Gin 


Phe 


Pro 


Cys 


Ala 


Arg 


Gly 


Gin 






1295 








1300 








1305 










tgt 


gtg 


gac 


ctg 


cgc 


ctg cgc 


tgc 


gac 


ggc 


gag 


gca 


gac 


tgt 


cag 


gac 


4045 


Cys 


Val 


Asp 


Leu 


Arg 


Leu Arg 


Cys 


Asp 


Gly 


Glu 


Ala 


Asp 


Cys 


Gin 


Asp 




1310 








1315 








1320 








1325 




cgc 


tea 


gac 


gag 


gtg 


gac 


tgt 


gac 


gec 


ate 


tgc 


ctg 


ccc 


aac 


cag 


ttc 


4093 


Arg 


Ser 


Asp 


Glu 


Val 


Asp 


Cys 


Asp 


Ala 


He 


Cys 


Leu 


Pro 


Asn 


Gin 


Phe 












1330 








1335 








1340 




egg tgt 


gcg 


age 


ggc 


cag 


tgt 


gtc 


etc 


ate 


aaa 


cag 


cag 


tgc 


gac 


tec 


4141 


Arg Cys 


Ala 


Ser 


Gly 


Gin 


Cys 


Val 


Leu 


He 


Lys 


Gin 


Gin 


Cys 


Asp 


Ser 










1345 








1350 








1355 






ttc 


ccc 


gac 


tgt 


ate 


gac 


ggc 


tec 


gac 


gag 


etc 


atg 


tgt 


gaa 


ate 


acc 


4189 


Phe 


Pro 


Asp 


Cys 


He 


Asp 


Gly 


Ser 


Asp 


Glu 


Leu 


Met 


Cys 


Glu 


He 


Thr 








1360 








1365 








1370 








aag 


ccg 


ccc 


tea 


gac 


gac 


age 


ccg 


gec 


cac 


age 


agt 


gec 


ate 


ggg 


ccc 


4237 


Lys 


Pro 


Pro 


Ser 


Asp 


Asp 


Ser 


Pro 


Ala 


His 


Ser 


Ser 


Ala 


He 


Gly 


Pro 






1375 








1380 








1385 










gtc 


att 


ggc 


ate 


ate 


etc 


tct 


etc 


ttc 


gtc 


atg 


ggt 


ggt 


gtc 


tat 


ttt 


4285 


Val 


He 


Gly 


He 


He 


Leu 


Ser 


Leu 


Phe 


Val 


Met 


Gly 


Gly 


Val 


Tyr 


Phe 




1390 








1395 








1400 








1405 




gtg 


tgc 


cag 


cgc 


gtg 


gtg 


tgc 


cag 


cgc 


tat 


gcg 


ggg 


gec 


aac 


ggg 


ccc 


4333 


Val 


Cys 


Gin 


Arg 


Val 


Val 


Cys 


Gin 


Arg 


Tyr 


Ala 


Gly 


Ala 


Asn 


Gly 


Pro 












1410 








1415 








1420 




ttc 


ccg 


cac 


gag 


tat 


gtc 


age 


ggg 


acc 


ccg 


cac 


gtg 


ccc 


etc 


aat 


ttc 


4381 


Phe 


Pro 


His 


Glu 


Tyr 


Val 


Ser 


Gly 


Thr 


Pro 


His 


Val 


Pro 


Leu 


Asn 


Phe 










1425 








1430 








1435 






ata 


gec 


ccg 


ggc 


ggt 


tec 


cag 


cat 


ggc 


ccc 


ttc 


aca 


ggc 


ate 


gca 


tgc 


4429 


He 


Ala 


Pro 


Gly 


Gly 


Ser 


Gin 


His 


Gly 


Pro 


Phe 


Thr 


Gly 


He 


Ala 


Cys 








1440 








1445 








1450 








gga 


aag 


tec 


atg 


atg 


age 


tec 


gtg 


age 


ctg 


atg 


ggg 


ggc 


egg 


ggc 


ggg 


4477 


Gly 


Lys 


Ser 


Met 


Met 


Ser 


Ser 


Val 


Ser 


Leu 


Met 


Gly 


Gly 


Arg 


Gly 


Gly 






1455 








1460 








1465 










gtg 


ccc 


etc 


tac 


gac 


egg 


aac 


cac 


gtc 


aca 


ggg 


gec 


teg 


tec 


age 


age 


4525 


Val 


Pro 


Leu 


Tyr 


Asp 


Arg 


Asn 


His 


Val 


Thr 


Gly 


Ala 


Ser 


Ser 


Ser 


Ser 




1470 








1475 








1480 








1485 




teg 


tec 


age 


acg 


aag 


gec 


acg 


ctg 


tac 


ccg 


ccg 


ate 


ctg 


aac 


ccg 


ccg 


4573 


Ser 


Ser 


Ser 


Thr 


Lys 


Ala 


Thr 


Leu 


Tyr 


Pro 


Pro 


He 


Leu 


Asn 


Pro 


Pro 












1490 








1495 








1500 




ccc 


tec 


ccg 


gec 


acg 


gac 


ccc 


tec 


ctg 


tac 


aac 


atg 


gac 


atg 


ttc 


tac 


4621 


Pro 


Ser 


Pro 


Ala 


Thr 


Asp 


Pro 


Ser 


Leu 


Tyr 


Asn 


Met 


Asp 


Met 


Phe 


Tyr 










1505 








1510 








1515 






tct 


tea 


aac 


att 


ccg 


gee 


act 


gcg 


aga 


ccg 


tac 


agg 


ccc 


tac 


ate 


att 


4669 


Ser 


Ser 


Asn 


He 


Pro 


Ala 


Thr 


Ala 


Arg 


Pro 


Tyr 


Arg 


Pro 


Tyr 


He 


He 








1520 








1525 








1530 








cga 


gga 


atg 


gcg 


ccc 


ccg 


acg 


acg 


ccc 


tgc 


age 


acc 


gac 


gtg 


tgt 


gac 


4717 


Arg Gly 


Met 


Ala 


Pro 


Pro 


Thr 


Thr 


Pro 


Cys 


Ser 


Thr 


Asp 


Val 


Cys 


Asp 






1535 








1540 








1545 










age 


gac 


tac 


age 


gee 


age 


cgc 


tgg 


aag 


gee 


age 


aag 


tac 


tac 


ctg 


gat 


4765 


Ser Asp 


Tyr 


Ser 


Ala 


Ser 


Arg 


Trp 


Lys 


Ala 


Ser 


Lys 


Tyr 


Tyr 


Leu 


Asp 





1550 1555 1560 1565 
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ttg aac teg gac tea gac ccc tat cca ccc cca ccc acg ccc cac age 4813 
Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

1570 1575 1580 

cag tac ctg teg gcg gag gac age tgc ccg ccc teg ccc gec acc gag 4861 
Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 

1585 1590 1595 

agg age tac ttc cat etc ttc ccg ccc cct ccg tec ccc tgc acg gac 4909 
Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp 

1600 1605 1610 

tea tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4965 
Ser Ser 
1615 

aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5025 
taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 5085 
tggagaaata tttataaact taattttgta aaaca 5120 



<210> 3 

<211> 1615 

<212> PRT 

<213> Homo sapiens 



<400> 3 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu Leu Leu Leu 



1 








5 










10 










15 




Leu 


Leu 


Leu 


Leu 


Ala 


Leu 


Cvs 


Gly 


Cvs 


Pro 


Ala 


Pro 


Ala 


Ala 


Ala 


Ser 








20 










25 










30 






Pro 


Leu 


Leu 


Leu 


Phe 


Ala 


Asn 


Ara 


Aro 


Asp 


Val 


Aro 


Leu 


Val 


Asp 


Ala 






35 










40 










45 








Gly 


Gly 


Val 


Lys 


Leu 


Glu 


Ser 


Thr 


He 


Val 


Val 


Ser 


Gly 


Leu 


Glu 


Asp 




50 










55 










60 










Ala 


Ala 


Ala 


Val 


Asp 


Phe 


Gin 


Phe 


Ser 


Lys 


Gly 


Ala 


Val 


Tyr 


Trp 


Thr 


65 










70 










75 










80 


Asp 


Val 


Ser 


Glu 


Glu 


Ala 


He 


Lys 


Gin 


Thr 


Tyr 


Leu 


Asn 


Gin 


Thr 


Gly 










85 










90 










95 




Ala 


Ala 


Val 


Gin 


Asn 


Val 


Val 


He 


Ser 


Gly 


Leu 


Val 


Ser 


Pro 


Asp 


Gly 








100 










105 










110 






Leu 


Ala 


Cys 


Asp 


Trp 


Val 


Gly 


Lys 


Lys 


Leu 


Tyr 


Trp 


Thr 


Asp 


Ser 


Glu 






115 










120 










125 








Thr 


Asn 


Arg 


He 


Glu 


Val 


Ala 


Asn 


Leu 


Asn 


Gly 


Thr 


Ser 


Arg 


Lys 


Val 




130 










135 










140 










Leu 


Phe 


Trp 


Gin 


Asp 


Leu 


Asp 


Gin 


Pro 


Lys 


Ala 


He 


Ala 


Leu 


Asp 


Pro 


145 










150 










155 










160 


Ala 


His 


Gly 


Tyr 


Met 


Tyr 


Trp 


Thr 


Asp 


Trp 


Gly 


Glu 


Thr 


Pro 


Arg 


He 










165 










170 










175 




Glu 


Arg 


Ala 


Gly 


Met 


Asp 


Gly 


Ser 


Thr 


Arg 


Lys 


He 


He 


Val 


Asp 


Ser 








180 










185 










190 






Asp 


lie 


Tyr 


Trp 


Pro 


Asn 


Gly 


Leu 


Thr 


He 


Asp 


Leu 


Glu 


Glu 


Gin 


Lys 






195 










200 










205 








Leu 


Tyr 


Trp 


Ala 


Asp 


Ala 


Lys 


Leu 


Ser 


Phe 


He 


His 


Arg 


Ala 


Asn 


Leu 




210 










215 










220 










Asp 


Gly 


Ser 


Phe 


Arg 


Gin 


Lys 


Val 


Val 


Glu 


Gly 


Ser 


Leu 


Thr 


His 


Pro 


225 










230 










235 










240 


Phe 


Ala 


Leu 


Thr 


Leu 


Ser 


Gly 


Asp 


Thr 


Leu 


Tyr 


Trp 


Thr 


Asp 


Trp 


Gin 










245 










250 










255 
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Thr 


Arg 


Ser 


He 
260 


His 


Ala 


Cys 


Asn 


Lys 
265 


Glu 


He 


Leu 
275 


Ser 


Ala 


Leu 


Tyr 


Ser 
280 


Pro 


Gin 


Glu 
290 


Arg 


Gin 


Pro 


Phe 


Phe 
295 


His 


Thr 


Gly 


Trp 


Ser 


His 


Leu 


Cys 


Leu 


Leu 


Ser 


305 










310 








Cys 


Ala 


Cys 


Pro 


Thr 
325 


Gly 


Val 


Gin 


Met 


Lys 


Ala 


Gly 


Ala 
340 


Glu 


Glu 


Val 


Leu 


Leu 
345 


Arg 


Arg 


He 
355 


Ser 


Leu 


Asp 


Thr 


Pro 
360 


Asp 


Val 


Asp 
370 


Asp 


He 


Arg 


His 


Ala 
375 


He 


Ala 


Gly 


Tyr 


Val 


Tyr 


Trp 


Thr 


Asp 


Asp 


Glu 


385 










390 








Tyr 


Leu 


Asp 


Gly 


Ser 
405 


Gly 


Ala 


Gin 


Thr 


Asp 


Pro 


Asp 


Gly 
420 


He 


Ala 


Val 


Asp 


Trp 
425 


Thr 


Asp 


Thr 
435 


Gly 


Thr 


Asp 


Arg 


He 
440 


Glu 


Ser 


Arg 
450 


Lys 


He 


Leu 


Val 


Ser 
455 


Glu 


Asp 


Ala 


Leu 


His 


Pro 


Val 


Met 


Gly 


Leu 


Met 


465 










470 








Asn 


Pro 


Lys 


He 


Glu 
485 


Cys 


Ala 


Asn 


Leu 


Leu 


Val 


Asn 


Ala 
500 


Ser 


Leu 


Gly 


Trp 


Pro 
505 


Gin 


Glu 


Gly 
515 


Lys 


Leu 


Tyr 


Trp 


Gly 
520 


Asp 


Val 


He 
530 


Asn 


Val 


Asp 


Gly 


Thr 
535 


Lys 


Arg 


Leu 


Pro 


His 


He 


Phe 


Gly 


Phe 


Thr 


Leu 


545 










550 








Thr 


Asp 


Trp 


Gin 


Arg 
565 


Arg 


Ser 


He 


Glu 


Ser 


Arg 


Asp 


Val 
580 


lie 


He 


Asp 


Gin 


Leu 
585 


Ala 


Val 


Asn 
595 


Val 


Ala 


Lys 


Val 


Val 
600 


Gly 


Asn 


Gly 
610 


Gly 


Cys 


Ser 


His 


Leu 
615 


Cys 


Phe 


Cys 


Gly 


Cys 


Pro 


He 


Gly 


Leu 


Glu 


Leu 


625 










630 








lie 


Val 


Pro 


Glu 


Ala 
645 


Phe 


Leu 


Val 


Phe 


Arg 


He 


Ser 


Leu 
660 


Glu 


Thr 


Asn 


Asn 


Asn 
665 


Gly 


Val 


Lys 
675 


Glu 


Ala 


Ser 


Ala 


Leu 
680 


Asp 
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Arg 


Thr 


Gly 


Gly 


Lys 


Arg 


Lys 










270 






Met 


Asp 


He 


Gin 


Val 


Leu 


Ser 








285 








Arg 


Cys 


Glu 


Glu 


Asp 


Asn 


Gly 






300 










Pro 


Ser 


Glu 


Pro 


Phe 


Tyr 


Thr 




315 










320 


Gin 


Asp 


Asn 


Gly 


Arg 


Thr 


Cys 


330 










335 




Leu 


Ala 


Arg 


Arg 


Thr 


Asp 


Leu 










350 






Phe 


Thr 


Asp 


He 


Val 


Leu 


Gin 








365 








He 


Asp 


Tyr 


Asp 


Pro 


Leu 


Glu 






380 










Val 


Arg 


Ala 


He 


Arg 


Arg 


Ala 




395 










400 


Leu 


Val 


Asn 


Thr 


Glu 


He 


Asn 


410 










415 




Val 


Ala 


Arg 


Asn 


Leu 


Tyr 


Trp 










430 






Val 


Thr 


Arg 


Leu 


Asn 


Gly 


Thr 








445 








Leu 


Asp 


Glu 


Pro 


Arg 


Ala 


He 






460 










Tyr 


Trp 


Thr 


Asp 


Trp 


Gly 


Glu 




475 










480 


Asp 


Gly 


Gin 


Glu 


Arg 


Arg 


Val 


490 










495 




Asn 


Gly 


Leu 


Ala 


Leu 


Asp 


Leu 










510 






Ala 


Lys 


Thr 


Asp 


Lys 


He 


Glu 








525 








Arg 


Thr 


Leu 


Leu 


Glu 


Asp 


Lys 






540 










Leu 


Gly 


Asp 


Phe 


He 


Tyr 


Trp 




555 










560 


Arg 


Val 


His 


Lys 


Val 


Lys 


Ala 


570 










575 




Pro 


Asp 


Leu 


Met 


Gly 


Leu 


Lys 










590 






Thr 


Asn 


Pro 


Cys 


Ala 


Asp 


Arg 








605 








Phe 


Thr 


Pro 


His 


Ala 


Thr 


Arg 






620 










Leu 


Ser 


Asp 


Met 


Lys 


Thr 


Cys 




635 










640 


Thr 


Ser 


Arg 


Ala 


Ala 


He 


His 


650 










655 




Asp 


Val 


Ala 


He 


Pro 


Leu 


Thr 










670 






Phe 


Asp 


Val 


Ser 


Asn 


Asn 


His 



685 
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lie Tyr 


Trp 


Thr 


Asp 


Val 


Ser 


Leu 


Lys 


Asn 


He 


Ser 


Arg 


Ala 


Phe 


Met 


690 










695 










700 










Asn Gly 


Ser 


Ser 


Val 


Glu 


His 


Val 


Val 


Glu 


Phe 


Gly 


Leu 


Asp 


Tyr 


Pro 


705 








710 










715 










720 


Glu Gly 


Met 


Ala 


Val 


Asp 


Trp 


Met 


Gly 


Lys 


Asn 


Leu 


Tyr 


Trp 


Ala 


Asp 








725 










730 










735 




Thr Gly 


Thr 


Asn 


Arg 


He 


Glu 


Val 


Ala 


Arg 


Leu 


Asp 


Gly 


Gin 


Phe 


Arg 






740 










745 










750 






Gin Val 


Leu 


Val 


Trp 


Arg 


Asp 


Leu 


Asp 


Asn 


Pro 


Arg 


Ser 


Leu 


Ala 


Leu 




755 










760 










765 








Asp Pro 


Thr 


Lys 


Gly 


Tyr 


He 


Tyr 


Trp 


Thr 


Glu 


Trp 


Gly 


Gly 




Pro 


770 










775 










780 










Arg lie 


Val 


Arg 


Ala 


Phe 


Met 


Asp 


Gly 


Thr 


Asn 


Cys 


Met 


Thr 


Leu 


Val 


785 








790 










795 










800 


Asp Lys 


Val 


Gly 


Arg 


Ala 


Asn 


Asp 


Leu 


Thr 


He 


Asp 


Tyr 


Ala 


Asp 


Gin 








805 










810 










815 




Arg Leu 


Tyr 


Trp 


Thr 


Asp 


Leu 


Asp 


Thr 


Asn 


Met 


He 


Glu 


Ser 


Ser 


Asn 






820 










825 










830 






Met Leu 


Gly 


Gin 


Glu 


Arg 


Val 


Val 


He 


Ala 


Asp 


Asp 


Leu 


Pro 


His 


Pro 




835 










840 










845 








Phe Gly 


Leu 


Thr 


Gin 


Tyr 


Ser 


Asp 


Tyr 


He 


Tyr 


Trp 


Thr 


As P 


Trp 


Asn 


850 










855 










860 










Leu His 


Ser 


He 


Glu 


Arg 


Ala 


Asp 


Lys 


Thr 


Ser 


Gly 


Arg 


Asn 


Arg 


Thr 


865 








870 










875 










880 


Leu lie 


Gin 


Gly 


His 


Leu 


Asp 


Phe 


Val 


Met 


Asp 


He 


Leu 


Val 


Phe 


His 








885 










890 










895 




Ser Ser 


Arg 


Gin 


Asp 


Gly 


Leu 


Asn 


Asp 


Cys 


Met 


His 


Asn 


Asn 


Gly 


Gin 






900 










905 










910 






Cys Gly 


Gin 


Leu 


Cys 


Leu 


Ala 


He 


Pro 


Gly 


Gly 


His 


Arg 


Cys 


Gly 


Cys 




915 










920 










925 








Ala Ser 


His 


Tyr 


Thr 


Leu 


Asp 


Pro 


Ser 


Ser 


Arg 


Asn 


Cys 


Ser 


Pro 


Pro 


930 










935 










940 










Thr Thr 


Phe 


Leu 


Leu 


Phe 


Ser 


Gin 


Lys 


Ser 


Ala 


He 


Ser 


Arg 


Met 


He 


945 








950 










955 










960 


Pro Asp 


Asp 


Gin 


His 


Ser 


Pro 


Asp 


Leu 


He 


Leu 


Pro 


Leu 


His 


Gly 


Leu 








965 










970 










975 




Arg Asn 


Val 


Lys 


Ala 


He 


Asp 


Tyr 


Asp 


Pro 


Leu 


Asp 


Lys 


Phe 


He 


Tyr 






980 










985 










990 






Trp Val 


Asp 


Gly 


Arg 


Gin 


Asn 


He 


Lys 


Arg 


Ala 


Lys 


Asp 


Asp 


Gly 


Thr 




995 










1000 








1005 






Gin Pro 


Phe 


Val 


Leu 


Thr 


Ser 


Leu 


Ser 


Gin 


Gly 


Gin 


Asn 


Pro 


Asp 


Arg 


1010 








1015 








1020 








Gin Pro 


His 


Asp 


Leu 


Ser 


He Asp 


He 


Tyr 


Ser 


Arg Thr Leu 


Phe 


Trp 


1025 








1030 








1035 








1041 


Thr Cys 


Glu 


Ala 


Thr 


Asn 


Thr 


He 


Asn 


Val 


His 


Arg 


Leu 


Ser 


Gly 


Glu 








1045 








1050 








1055 


Ala Met 


Gly 


Val 


Val Leu Arg Gly Asp 


Arg Asp Lys 


Pro Arg 


Ala 


He 






1060 








1065 








1070 




Val Val 


Asn 


Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn 


Met Gin Asp 




1075 








1080 








1085 






Arg Ala 


Ala 


Lys 


He Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 


1090 








1095 








1100 








Val Leu 


Phe 


Thr 


Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 



1105 1110 1115 1120 
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Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg lie 

1125 1130 1135 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 

1155 1160 1165 

Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys Thr Thr 

1170 1175 1180 

Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His Leu Thr Gly 
1185 1190 1195 1200 

He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 

1205 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu He Asp Cys He Pro Gly Ala Trp Arg Cys 
1265 ' 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val He Gly 

1380 1385 1390 

He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 
1540 1545 1550 



-16- 



WO 02/092000 



PCT/US02/14877 



Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 1560 * 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
3-585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 4 
<211> 1615 
<212> PRT 

<213> Homo sapiens 



<400> 4 



Met 


Glu 


Ala 


Ala 


Pro 


XT 1 \J 


















Leu 


Leu 


Leu 


UC Ll 


Al a 


utr Ll 


t.yb vaiy 








o a 

,<2 u 








Pro 


Leu 


Leu 


Leu 


XT 11C 


Al a 

■Til Cl 


Aon 7\ t~/*t 
i-i.oll /iiy 














>1 A 
4 U 


Gly 


Gly 


V Ct X 


j-jy o 


Leu 


clU 


oci ±nr 














3D 


Ala 


Ala 


Ala 

L CL 


vai 


7\ on 




rjl tl DVi» 
oi.Il irllc 


D O 










*7 A 




Asp 


Val 


Ser 


Ol Ll 


m ii 


Al a 


lie Jj y S 
















Al a 




V cl J. 


uin 


Asn 




\7n 1 Tl a 

vai lie 








1 A A 










^-4.1 ct 


y o 


Asp 


Trp 


vai 


oiy juys 






lie 
11 J 








"1 O A 

1*5 U 


Thr 






lie 




\7a 1 


Aid ASIl 




1 "3 ft 










1 J O 


Leu 


Phe 


Trp 


Gin 


Asp 


Leu 


Asp Gin 


145 










150 




Ala 


His 


Gly 


Tyr 


Met 


Tyr 


Trp Thr 










165 






Glu 


Arg 


Ala 


Gly 


Met 


Asp 


Gly Ser 








180 








Asp 


He 


Tyr 


Trp 


Pro 


Asn 


Gly Leu 






195 








200 


Leu 


Tyr 


Trp 


Ala 


Asp 


Ala 


Lys Leu 




210 










215 


Asp 


Gly 


Ser 


Phe 


Arg 


Gin 


Lys Val 


225 










230 




Phe 


Ala 


Leu 


Thr 


Leu 


Ser 


Gly Asp 










245 






Thr 


Arg 


Ser 


He 


His 


Ala 


Cys Asn 








260 








Glu 


He 


Leu 


Ser 


Ala 


Leu 


Tyr Ser 






275 








280 


Gin 


Glu 


Arg 


Gin 


Pro 


Phe 


Phe His 




290 










295 



f ro irp 


Fro lieu Leu Leu Leu 


Leu 


10 


15 




v-ys Fro 


Ala Pro Ala Ala Ala 


Ser 




30 




Arg Asp 


Val Arg Leu Val Asp 


Ala 




45 




ne vai 


Val Ser Gly Leu Glu 


Asp 




60 




Ser Lys 


Gly Ala Val Tyr Trp 


Thr 




75 


80 


vjlll llll 


iyr ireu Asn bin inr 


Gly 


O A 

90 


95 




Got" Cl "I \r 

oci oj.y 


iieu vai ber Fro Asp 


Gly 


1 A IT 

105 


110 




iiys jjeu 


Tyr Trp Thr Asp Ser 


Glu 




125 




T.pn Acsn 

J_ltr U. Jr\iz> 11 


uiy nil ber Arg iiys 


vai 




140 




Pro Lys 


Ala He Ala Leu Asp 


Pro 




155 


160 


Asp Trp 


Val Glu Thr Pro Arg 


He 


170 


175 




Thr Arg 


Lys He He Val Asp 


Ser 


185 


190 




Thr He 


Asp Leu Glu Glu Gin 


Lys 




205 




Ser Phe 


He His Arg Ala Asn 


Leu 




220 




Val Glu 


Gly Ser Leu Thr His 


Pro 




235 


240 


Thr Leu 


Tyr Trp Thr Asp Trp 


Gin 


250 


255 




Lys Arg 


Thr Gly Gly Lys Arg 


Lys 


265 


270 




Pro Met 


Asp He Gin Val Leu 


Ser 




285 




Thr Arg 


Cys Glu Glu Asp Asn 


Gly 



300 
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Gly Trp 


Ser 


His 


Leu 


Cvs 


Leu 


Leu 


Ser 


305 








310 








Cys Ala 


Cys 


Pro 


Thr 


Gly 


Val 


Gin 


Met 








32 5 










Lys Ala 


Gly 


Ala 


Glu 


Glu 


Val 


Leu 


Leu 






340 










345 


Arg Arg 


lie 


Ser 


Leu 


Asp 


Thr 


Pro 


Asp 




355 










360 




Val Asp 


Asp 


lie 


Arq 


His 


Ala 


He 


Ala 


370 










375 






Gly Tyr 

J 2 


Val 


Tyr 


Trp 


Thr 


Asp 


Asp 


Glu 


385 








390 








Tyr Leu 


Asp 


Gly 


Ser 


Gly 


Ala 


Gin 


Thr 








405 










Asd Pro 


Asd 


Glv 


lie 


Ala 


Val 


Asp 


Tro 






420 










425 


Thr* Asd 


Thr 


Glv 


Thr 


Asp 


Arg 


He 


Glu 




435 










440 




Ser Arg 


Lvs 


lie 


Leu 


Val 


Ser 


Glu 


Asp 


450 










455 






Ala Leu 


His 


Pro 


Val 


Met 


Gly 


Leu 


Met 


465 








470 








Asn Pro 


Lys 


lie 


Glu 


Cys 


Ala 


Asn 


Leu 








485 










Leu Val 


Asn 


Ala 


Ser 


Leu 


Gly 


Trp 


Pro 






500 










505 


Gin Glu 


Gly 


Lys 


Leu 




Trn 

hr 


Glv 


Asp 














520 




Val lie 


Asn 


Val 


Asp 


Glv 


Thr 


Lvs 


Arq 


53 0 










535 






Leu Pro 


His 


lie 


Phe 


Gly 


Phe 


Thr 


Leu 


545 








550 








Thr Asd 


Tro 


Gin 


Arg 


Arcr 


Ser 


He 


Glu 








565 










Ser Arg 


Asp 


Val 


lie 


lie 


Asp 


Gin 


Leu 






580 










585 


Ala Val 


Asn 


Val 


Ala 


Lvs 


Val 


Val 


Gly 




_> j j 










600 




7\qn Crl V 

rlDii. UXy 




Cys 


Ser 


His 


Leu 


Cys 


Phe 


610 










615 






Cys Gly 


Cys 


Pro 


lie 


Glv 


Leu 


Glu 


Leu 


625 








630 








lie Val 


Pro 


Glu 


Ala 


Phe 


Leu 


Val 


Phe 








645 










Arg lie 


Ser 


Leu 


Glu 


Thr 


Asn 


Asn 


Asn 






660 










665 


Gly Val 


Lys 


Glu 


Ala 


Ser 


Ala 


Leu 


Asp 




675 










680 




lie Tyr 


Trp 


Thr 


Asp 


Val 


Ser 


Leu 


Lys 


690 










695 






Asn Gly 


Ser 


Ser 


Val 


Glu 


His 


Val 


Val 


705 








710 








Glu Gly 


Met 


Ala 


Val 


Asp 


Trp 


Met 


Gly 



725 
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Pro 


Ser 


Glu 


Pro 


Phe 


Tyr 


Thr 




315 










320 


Gin 


Asd 


Asn 


Glv 


Arg 


Thr 


Cvs 


330 










335 




Leu 


Ala 


Arq 


Arq 


Thr 


Asp 


Leu 










350 






Phe 


Thr 


Asp 


lie 


Val 


Leu 


Gin 








365 








He 


Asp 


Tvr 


Asp 


Pro 


Leu 


Glu 






380 










Val 


Arg 


Ala 


He 


Arg 


Arg 


Ala 




395 










400 


Leu 


Val 


Asn 


Thr 


Glu 


He 


Asn 


410 










415 




Val 


Ala 


Arq 


Asn 


Leu 


Tvr 


Trp 










430 






Val 


Thr 


Arq 


Leu 


Asn 


Gly 


Thr 








445 








Leu 


Asp 


Glu 


Pro 


Arg 


Ala 


He 






460 










Tyr 


Trp 

IT 


Thr 


Asp 


Trp 


Gly 


Glu 




475 










480 


Asp 


Gly 


Gin 


Glu 


Arg 


Arq 
-** 


Val 


490 










495 




Asn 


Gly 


Leu 


Ala 


Leu 


Asp 


Leu 










510 






Ala 


Lvs 


Thr 


Asp 


Lvs 


He 


Glu 








525 








Arq 


Thr 


Leu 


Leu 


Glu 


Asp 


Lys 






540 










Leu 


Gly 


Asp 


Phe 


lie 


Tyr 


Trp 

IT 




555 










560 


Arq 


Val 


His 


Lys 

2 


Val 


Lys 


Ala 


570 










575 




Pro 


Asp 


Leu 


Met 


Gly 


Leu 


LVS 










590 






Thr 


Asn 


Pro 


Cys 

2 


Ala 


Asp 


Arq 








605 








Phe 


Thr 


Pro 


His 


Ala 


Thr 


Ara 






620 










Leu 


Ser 


Asp 


Met 


Lvs 


Thr 


Cys 




635 










640 


Thr 


Ser 


Arq 


Ala 


Ala 


He 


His 


650 










655 




Asp 


Val 


Ala 


He 


Pro 


Leu 


Thr 










670 






Phe 


Asp 


Val 


Ser 


Asn 


Asn 


His 








685 








Asn 


He 


Ser 


Arq 


Ala 


Phe 


Met 






700 










Glu 


Phe 


Gly 


Leu 


Asp 


Tyr 


Pro 




715 










720 


Lys 


Asn 


Leu' 


Tyr 


Trp 


Ala 


Asp 


730 










735 
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Thr Gly Thr Asn Arg He Glu Val Ala Arg Leu Asp Gly Gin Phe Arg 

740 745 750 

Gin Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser Leu Ala Leu 

755 760 765 

Asp Pro Thr Lys Gly Tyr He Tyr Trp Thr Glu Trp Gly Gly Lys Pro 

770 775 780 

Arg He Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met Thr Leu Val 
785 790 795 800 

Asp Lys Val Gly Arg Ala Asn Asp Leu Thr He Asp Tyr Ala Asp Gin 

805 810 815 

Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Met He Glu Ser Ser Asn 

820 825 830 

Met Leu Gly Gin Glu Arg Val Val He Ala Asp Asp Leu Pro His Pro 

835 840 845 

Phe Gly Leu Thr Gin Tyr Ser Asp Tyr He Tyr Trp Thr Asp Trp Asn 

850 855 860 

Leu His Ser He Glu Arg Ala Asp Lys Thr Ser Gly Arg Asn Arg Thr 
865 870 875 880 

Leu He Gin Gly His Leu Asp Phe Val Met Asp He Leu Val Phe His 

885 890 895 

Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn Asn Gly Gin 

900 905 910 

Cys Gly Gin Leu Cys Leu Ala He Pro Gly Gly His Arg Cys Gly Cys 

915 920 925 

Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys Ser Pro Pro 

930 935 940 

Thr Thr Phe Leu Leu Phe Ser Gin Lys Ser Ala He Ser Arg Met He 
945 950 955 960 

Pro Asp Asp Gin His Ser Pro Asp Leu He Leu Pro Leu His Gly Leu 

965 970 975 

Arg Asn Val Lys Ala He Asp Tyr Asp Pro Leu Asp Lys Phe He Tyr 

980 985 990 

Trp Val Asp Gly Arg Gin Asn He Lys Arg Ala Lys Asp Asp Gly Thr 

995 1000 1005 

Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn Pro Asp Arg 

1010 1015 1020 

Gin Pro His Asp Leu Ser He Asp He Tyr Ser Arg Thr Leu Phe Trp 
1025 1030 1035 1040 

Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu Ser Gly Glu 

1045 1050 1055 

Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala He 

1060 1065 1070 

Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gin Asp 

1075 1080 1085 

Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 

1090 1095 1100 

Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 1110 1115 1120 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He 

1125 1130 1135 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 
1155 1160 1165 
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Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys Thr Thr 

1170 1175 1180 

Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His Leu Thr Gly 
1185 1190 1195 1200 

He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 

1205 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys lie Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu He Asp Cys He Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val He Gly 

1380 1385 1390 

He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 

1540 1545 1550 

Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 ' 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 1600 
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Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 5 

<211> 3096 

<212> DNA 

<213> Homo sapiens 

<400> 5 

catcttctca cacgatctct cgcttcgcac tccttccttt gattggtttt caccatttac 60 

tcagacgacg gtccttcttc gatctttgca cattcttcta tcatctacta ccttcatacc 120 

cagctccgtc ccctaatatt catgcgcgga tggcccattc cgtggtgaaa attcccttct 180 

actctgctaa tctgctgttc tctctccctc ccgtcgggfct ctgctcctgc cacgttctcc 240 

cctctcccca ccaaaggctg ggttttcttt gtcagggctc ctttcccctt tggaagaagg 300 

ggggctgtat ggccttggtg cgaggccctc cagtgacagg atcccccatc acccagagtt 360 

ccacaggccc tggtagggag gagggggagc agaagaggag gtgccatctt tgcctgctgg 420 

ggaagggcag gggccaccca cacagagctc tcccatttgc tgtggaccct ggggccactg 480 

cccagttcct tccaaaggaa agccagctcc ccaggtggtg ggagagtgat atggcttcct 540 

cttaaactta gggaattgag tgtgtggttg cttctaagtg ccttagaagc cgggagcggc 600 

tcctggaaag agcctgcctg ccacagcggg ccttaccctg gctgtgccca cagatgtccc 660 

tggggcctgc cgctcctgcc cggctctcct ggcctccccc ggtgtgggtt gggaaaagca 72 0 

cagcaaatta aaaaacacct ccatctctgg cctttgaaga atgcatctga acagccgaga 780 

gtgtaaaccg tggtgaaatg tggtctttcc agtttgggga gaagcagggc agagctgggg 84 0 

cttttgtacc cagggtttcc aagagctcct gcctccctcg gctgggctgg ccagggcccc 900 

ccgctgggac ctccagctgt aatagggaag gttttactgg gttgctggcc actgtggact 960 

gcccctaagg gcaggtatgc ctgcctttac ccgggttccc ctcctgcctg gaagatacag 1020 

cccatgggag gcctgttgtc tgtgggatcc tccagcatca gagacactgg ggccagcgtc 1080 

tgcctggtga ggtgcaggcc tggcaggccc ggtcccccac ctgcttgagc acccacggtg 1140 

gtgggggctc gctgcctccc gagacaatct atgtcattgt tgtccaagga agctaattta 1200 

gagtagaaag ttccgtgtcc agtcccactc tgtgcgtgtg ttagcagggg actctcgggc 1260 

cggagctggg tccaccctgg tagggggact tcatggggcc tgggcgacag cactgtgtat 1320 

ttgtgtgtgt gtgtgtttgt gtgtgtgtgt gtctgaggag gtggaccagt ttctcaaaag 13 80 

gcctgtgacc ccaagaacca aggaatttca gcctgggtgg atcacacctt cactggtgag 1440 

tgggacaagc tgggggccct cgccacagga gcagccaggg catggggcac agttggcctc 1500 

attcacaaaa tgggagtata agtgatccct gctctggcgg ccaggacgat gagtgggaac 1560 

acaccgtgtg ggggctgcct ggcctgggtg tgccgcgggt gtccttgttg gtgatggttc 162 0 

cacctgcttg tgccaccagt gccctctggg tctcacacac aactctcttc ccagcgaagg 1680 

cccctcctgc cctcaggcct cagtgctgct tccgtctcgg aaggccccag gagctcctgc 1740 

atcctgggcg tgattcctgt gtgcctgcag accccctcgc ggctgccatc tcatcctttg 1800 

gtgcacctgt tggccagacc tcctggtagc gggtgctgca ctcccctgaa tgtgccgggg 1860 

cctgggggca gggacctggg ctcctccctc actgagtgga gggaactcag tgtcttggag 1920 

ttggggtgcc tgcaggctgg gtggtgcagg tgaaatgcag acctctcagc tggtgttcca 1980 

gagcagctgc cttcccccgc ccgagggact tcacccgcag cccagtcagg ggtggcgcct 204 0 

gggtgcatcg cccgcaggct gggtaggggt ggagcctggg tggccctgcc tgtgagctgc 2100 

atagttgtcg cctttgaccc tgagttttct tcgttatctg tttggacctg tttggggcag 2160 

gcaggggatg agatctgaag ataaatgcct tagctgtgac catctccttt tgtgagaggt 2220 

caatgtccag ttccgctgca gttataacat cccatttttt gatttctttt tattttttcc 2280 

tttttctttt tgagatggag tctcgctctg tcacccaggc tggagtgcaa tggggtgacc 2340 

tcagctcact gcaacctcca cttctcgggt tcaagtgatt ctcctgcctc agcctcctga 2400 

ctagcagggg ttacaggcgt gagccaccac gcccagctaa tttttgtatt tttagtagag 2460 

gcaaggtttc gtcatgttgg ccaggctggt ctcaaactcc tggccttaag tgatctgccc 2520 

gcctcggcct cccaaagtgc tgagatgaca ggtgtgagcc accgtgcccg gcccagaact 2580 

ctttaattcc cacctgaaac ttgccgcctt aagcaggtcc ccagtctccc tcccctagtc 2640 
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cctggtccca ccattctgct ttctgtctca atgaatttgc ctaccgtaag tacctcatat 2700 

aaattgaatc ataaagtatt tgtcttttta tatctggctt atttcactta gcataacatt 2 760 

cttaagtttc atccatgttg tagcatgtgt cagaatctct ctcttttttt tttttttttt 2820 

tttttttttt ttttgcagac agagtctcgc tctgtcatct agactggagt tcagtggcac 2880 

gatctcggtt cactgcaaca tctgcctcct gggtccaagc aattctcctg cctcagcctc 2940 

cttagcagct ggaactacag gcgcgtgcca ccatgccttg ctaatttttg tatttttatg 3 000 

tggaggcagg gtttcaccat cttggccagg ctggtctcga attcctggtc ttcaccacgg 3 060 

gggcccgaag gacccgggca aagcgtggag gggagg 3 096 

<210> 6 

<211> 26928 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 

<222> (12044) , (12489) , (26433) , (26434) , (26435) , (26436) , (26439) , (26441) 
<223> Identity of nucleotide sequences at the above locations are unknown. 

<400> 6 

gaagaccaag ggcacacagc gaggcagttt cagggcgggc agcctggggc cccacggggc 60 

ggccccggac acttgttctc acctgtggag ggcagagaag ggaacaggga gagaagtggc 12 0 

cggctgggag tggaggtggg tttgaggttt tactgtaaac taaatgtgta ccctctacct 180 

tagttatgaa ttatgagaca cgaagactgc gaaacagaca cactcctcta aaagtgcctc 240 

taggctgaca gggagaaagt cccgccaggc tcccagacgc cacctttgag tccttcaaca 3 00 

agcccgccag ggcctcttgc ccaccggtgt cagctcagcc actgaaccct ccaggaagaa 3 60 

gacgtgctgg taggagaaga atctcaccca ggcacagcct ggaaggggca cagaaggggc 420 

tccggaacca gcaagcccaa gttggaactc ccagtctgct actttctaga acgactgtgc 4 80 

ccttggcggg tctaagtaga acctctccgc gcactctttc ctcctttgta aagtggggac 540 

agcaatggcc accttgcagg ttcagagagg gcttgcagta cctcacagaa ctgagtgccc 600 

gtgaacgtgt gtgttcctcc agatttgtga cagctttgcc aggctggagt caggctgaac 660 

gcctctgccc tcatggggtt tatattctag gaagaccaac aaaaacaaga agacggaaaa 720 

ttaaaacaac aaaagcccca ttgacaggcc gtgaagaatg ccatgaaaaa tgaatggcgt 780 

tgtgctgcag tctttgggga aacgggctta cggaaagaag gacacttgag ctgctaccaa 840 

tgagcagccg tccggtggga gggcagttca ggaagagcag acatccactg aggaggcgct 900 

ggggcagagg gcagcctggt cgctggattc gggggaggaa ccacatcagg ccatgagctg 960 

gagctggtgg tagaatgtac aggagaggcc agccagggcc agctcatgtc agacctcaag 102 0 

cggggaagat gaatcgagaa tgcaccccac gagcaatggg aagccagtct acgatttaag 1080 

cagcaaaaat attttccctt cttccaccct gcatccagct ctaccagcac agcctggggt 1140 

tctattttca agatagaata gacccagact cccagctctt cttacacttc tactactgcc 1200 

acctgtcacc cactcatgcg tccccacttg cagcctcgac ccccttccac ctgatctcat 1260 

ggcagccagg gaagctccag ggctcgtgag ggctgccatc tcaggaaaga agcaaaagcc 1320 

ttcggcacct gcagggcctg ctccaaccac acttcttcct tgacctctca gcttccttag 13 80 

ccactccctt cccacatctc accctgctcc agccacagtg gtgtctctgt gggttctcaa 1440 

acacaccagg tgcactcctg cctcagggcc tttgtgcttg ctgttctctg ctgggactct 1500 

tttttttttt tttttttttg agacagggtc tcactctgtg gcccaggctg gagtgtagtg 1560 

gtgtgatcgt agctcattgc aacctcaaac tcctgggctc aagcaatcct cccacctcag 162 0 

cctctcaagt agttagcttt tgttgttttg ttttgagatg ggatctcact ctgttgccca 168 0 

ggctggagtg cagtggggca atcttggctc accacaacct ctgcctccca ggctcaagca 1740 

attctcctgc ctcagcctcc caagtagctg ggattacagg catgtgccac cacgcccagc 1800 

ttatttttgt atttttagta gagacagggt ttcaccatgt tggtctggct ggtcttgaac 1860 

tcctggcctc agatgatcca cctgcctcgg cctcccaaag tgctgggatg acaggcatga 192 0 

gcctgtctct agtagttagg actacagaga ggggccatca tgcctggtga tcctcccacc 1980 

ttttctgctc caactctttc accccactta gcctcgtggc tcactctctt acctcttcag 2040 
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ctcctcagtc aggcctgagg acccctgttg aaaattgcaa accacacccc ccaccaccac 210 0 

cacccactat tgccagcact ttctactcca tttctctgct ttacttttct cctttgtact 2160 

catcaccacc tgactcatta catgtttacg tatctttctt ctctccacta gcatggaagc 222 0 

tccaggagag cagagagtgt agttttattc cctgatgtgt ttcctgtgcc cgtaccaggg 2280 

cctagcacac agtaggtgct cagtaaatgt gtgttggatg aacaaataca gtgaaaggat 234 0 

ctgatctaca tttataaaga aggcactctg gctgctgagt ggggatgaga ctgtcaggag 2400 

gaaagaggcc cctgtggggg cctggccagc aggtgggtac aatggtagca gccaggagag 2460 

agggcctctt ggactcaagt ggatggggcc tgctcagggc tccggccaca ggaacaaagg 252 0 

gaagggggcc caggatggcc tgtcatagag gacacattac aactggccca aagttcaagt 258 0 

caggtttcta aatttgggaa ' gggatacaga aaaactaaag actctactgg acagtcagtt 2640 

attgaaatga ttacatagaa aatgtaccaa gaattaaaaa aaaaaaaaaa aagcattatg 2700 

aaggggccac cagagactcc cagagaggaa agggactatg ggctggatgc ggtgactcac 2760 

acctataatc ccagcacttt gggaggccga ggagggtgga tcacgaggtc aggagttcaa 282 0 

aaccagccta ggcaacatgg taaaaccccc gtttctacta aaaatacaaa aaattagctg 2880 

ggcatggcag catgtgcctg taatcccagc tactcgggag gctgaggcag gagagttgct 2940 

agaacccagg aggcagaggt tgcagtgagc cgagattgag ccactatgct ccagcttggg 3000 

cgacagagca agactccgtc tctaaaaaaa agaaaaaaaa ggccagatga ggtggctcat 3060 

gcctgtaatc ccagcacttt gggaggccga ggtgggtgga tcacgaggtc aggagatcga 312 0 

gaccatcctg gctaacatgg tgaaactcca tctctactta aaatacaaaa aattagccgg 318 0 

gcgtggtggc gggcacctgt agtcccagct acttgggagg ctgaggcagg agaatggcgt 3240 

gaacctggga ggcggagctt gcagtgagcc gagattgcgc cactgcactc catccagcct 33 00 

gggcgacaga gttagactcc gtctcaaaaa aaaaaaaaaa aaaaaaatta gctgattagt 3360 

tgggcttggt ggcgggcgcc tgtaatccca actactcggg aggctgaggc gggagaatca 3420 

cttgaacccg ggaggcagag gttgcaatga gccgatatca cgccactaca ctccagcctg 3480 

ggcgacagag caagactcca tctcaaaaaa gaaaaaaaaa aagaaagggg ctgtgctgtg 3540 

gcctgggacc caaagcacac tactgcaagg tcccagggtg cctgactcca accggagcct 3600 

tgagaacatt catttgcaaa gaatgaatta aaattcagca ctattttatt ctgcaggatt 3660 

ccagcacccc aaggacagtc atttttagac ccttcagtaa cgtaataagt aaccggagga 3720 

tgtgctgagc ttccacttcc ccagacggtt gcctgtcaca gctcatcagg ccaacaaact 3780 

tttcttaggc ctcaaatttg gaaatgttca ctctcagttc gttccttaga tgcaagtcca 3 840 

tcccaatgaa gtaacagggg ctcagcacct gtccaatctc attgcttccg gggacagggg 3900 

cccatgagga tgtcgtttca gcccggtgac acttgggcaa agtgcctttt ggtttccctc 3 960 

ccaggctgga acgtgctggc tctgtgaagt tacgctgggc acaagagccc cccccaaccc 4 020 

ggcaggactg actgctgtgg tcagaggcgc ccctggggct ttgggagcca cagaatcttc 4080 

ctgagggcag cgccggagga ggccccagtg agagtgccca ctgccaggct cattcctcag 4140 

gctgccgcag gcctctcccc aaaacaggca atgcttctca gcaacctgcc ccaggagcag 4200 

gccagggaag gccgccatcg gcctacagtg ctgggctctg gagggcttgg ttggtaacag 42 60 

gccatggttt ctatgagcca gctggggtgt gaaggacaca ggctggattc acctctctgg 4320 

gcctcagttt ctgcattcaa aaagtgggaa tcatgatatc tgctctattt cttatctctc 4380 

agtgctgatg tgaacctcca ataagacttt taaaaatact ctttctacct tacttttatt 4440 

tttcatttat tttaagataa tgtctagctg tctcacccag gctggagtgc agtggtgtga 4500 

ttacggctca ctacagcctt aacctcccag gctcaagtga tcctcctacc acagcctccc 4560 

aagtagctgg aactacaggc atgcaccacc gcacctggat aattttttct tttgagacaa 4620 

ggtttcactc tgttgcccag gctggagtgc agtggtgcac tcttggctca ctgcagcctc 4680 

aacctccctg ggcttaggtg atcctcacac ttcagtctcc caagtagctg ggactacagg 4740 

tatgtgccag tacacccagc taatattttt gaaggatggg gtttcactat attgcccagg 4 800 

ctggtcttga actccagggt ttaagcaatc taccttcctc agcctgccaa agtgctagga 4860 

ttataggtat gagccacccc ccggcctata atcctaccac tttaaaaaag cctgtaattt 4920 

tagcacttta aaaaattttt ctaaattttt tatagagatg ggggacagct gtggtctcac 4 980 

tgtgttgccc aggctggtct tgaactccta ggatcaagcc atcctcctgg cctggcctcc 5040 

caaagtgttg ggattataag cataagcctt accttacctt ttttttttga gttgcagttt 5100 

tgttcttgtt gctcaggctg gagtgcaatg gcaagatctt ggctcactgc aacctccacc 5160 

tcccgggttc aagcaattct cctgcctcag cctcccgagt agctgggatt acaggcatgc 5220 

gccaccacac ccagctaatt ttgtattttt agtagagatg gggtttctct atatacctta 52 80 



-23- 



WO 02/092000 



PCT/US02/14877 



attttaaagc actgcattca tgtaaattgt gattaacatg gattcaagag agggagtgag 5340 

gatgaatgag ccaggcagtc acctcggctg tcaccctcca cttctctcct ccttctgaca 5400 

gtcatcgtcc atccgtttct gcagctgttt gtttgactct cctgatcatt ttgcttgcca 5460 

cataacttgc ctcctgggaa agaatgccct gggcaggccc acatgagtag tgaaaaataa 552 0 

tctgcagtga aaaataaaac taagtagtct ggtccacaga gcagtcttat tttttcactg 5580 

cagatgaagg agttgacatt caggcttcat tctcatttat aagtgtttta aagacacata 564 0 

cagtggattg aacagtggcc ttcaaaaaga tgtatctaca tcctaatccc tgggacctgt 5700 

gaatgttaac caagttagga aaagggtctt cccgggtgtc attaagttag agatcttgag 5760 

atgaggagct catcgtggat tatccaggtg gaccctgcat ccaaggacaa atggtcctta 5 82 0 

gaaaagaaaa gcagaggctg ggcacagtgg ctcaagcctg taatcccagc actttgagag 58 80 

gccgaggtgg gtggatcacc taaggtcatg agttcgagag cagcctggcc aacatgatga 5940 

aatcccatct ctactaaaaa tacaaaaatt agcaaggcat ggtggcgggt gcctataatc 6000 

ccagctactc aggaagctga ggcaggagaa tggcttgcac ctgggaggcg gaggttgcag 6060 

tgagccaaga tcgcgccact gcactccagc ctgagggaga aaagtgaaac tctgtctcat 612 0 

aaaagaaaag aaaagcagac agagatctga gacagaagag gagagtgaag gaaaaaaggc 6180 

catgtgaaga tgaggcagag gttggagcca tgcagccaca agccaaggaa tacctggagc 6240 

cccagaagtt gcaagaggta ggaagaagcc tcccctagag cctccagacg gagcacagcc 63 00 

ctgccaacac ctccacctca gacttctggc ctccagcact gtgagataat caactgctgt 63 60 

tgttttaagc caccagattt gtggtaattt gttatggcag ccacaggaaa ctaatacagt 6420 

acctaatctt cacaaaccca tcttacagaa aaggaaactg aagtcagaga ggtagtggct 6480 

tgtgcagtgt gttaggccat tcttgtatta ctataaagaa atacctgagg ccgggcatgg 6540 

tggctcacgc ctgtaatccc agcactttgg gaggccaagg tgagtggatc acttgaggtc 6600 

aggagttcaa gaccagcctg gacaacatgg tgaaacccca tttctactga aaatatgaaa 6660 

attagccagg catggtggcg tgcatctgta gtcccagcta ctcaggaggc tgaggcagga 672 0 

gaatcacttg cgcccgggag gaggaggttg tagtgagcca agattgtgcc actgcactcc 6780 

agcctgggag acaagagaga aaccctgtct caaaataaat aaaaaacaaa taaacacctg 684 0 

agactgggta gtttataaag aaaggggtta actggctccc ggttctgcag gctgtacaag 6900 

catggtgccg gcatctgctt ggttgctggg aaggcttcag ggagttttac tcatcgtgga 6960 

aggcagagcc agagcaggtg catcacacag caaaagcagg agcgagagag agagagagca 7020 

gggaggtgtg cacactttta aatgagcaga tctcacgaga actcaccatt gcaaggacag 7 080 

caccaagcca cgaggggtct gcccccatga cccaaacctc ccactaggcc ccacccccaa 7140 

cattgggaat tacagttcaa catgaggttt ggggggacaa atatccaaac tatatcattc 7200 

cacccctggc cccccagatc tcatgttctt ctcacattgc aaaatatagt catgccttcc 7260 

cagtagcccc ccaaagtctt aactcatccc agcattaact caaaaatccc attcccaagt 7320 

ccaacgtctc atctgaagat gagttccttt cacctacaag actgtaaaaa tgaaaacagt 73 8 0 

tatttactgc tgagatacaa tgggggcata ggcattaggt aaacattcct gttccaaaag 744 0 

ggagaaatcg gtcaaaagaa aggggctata ggccccaagc aagtccaaaa cccagcagag 750 0 

caatcattca atcttaaagc tccaaaataa cctccttaaa ctccatgtcc catagccagg 7560 

gcacactggt gcaaggggca ggctcccaag gccttgggca gctctattcc tgcggctttg 7620 

cagaattcag tccccatggc tgctcttaca gattggagat gagggcctgc ggcttttcca 7680 

ggtgcagggt gcaagctgct ggtgatctac cattctgggg tgtggatggt ggcggccccg 7740 

tcccgcagct ccactaggca ttgtcccagt ggggactcta tgtggggcct ccaaccccac 7800 

atttcccctc caatgggaag gctctgcccc tgcagcagcc ttcttcctgg gctcccaggc 7860 

tttctcatac atcctctgac atctaggtgg atggtgtcaa gcttccttca ctcttgcact 7920 

ctgcacacct acaggcttaa caccacatgg aagctgccaa ggtgtatggc tggaaccctc 7980 

tgaagcagca gcctgagctg tgactatggc cctttgagcc aaggctggag ctggaacagt 804 0 

ctagatgcag gcagggagca gtgtcctgag gctgtgcaga gcagcagggc cctgtgcctg 8100 

gacaatgaaa ccattctttc ctcctcatcc tctgggcctg tgatgggagg gttgtggaag 8160 

atctctgaaa tgcctttgag gcctttttgc ctctgaggcc tatttcctat tgtctcagtt 8220 

attggcagtc ggctcctttt tagttatgca aatcctctag caagaggtta ctccactgcc 8280 

ggcttgaact cctctcctga aaaagctttt tctttctttg tcacatggcc aggctgcaaa 834 0 

ttttccaaac ttttatgctc tgttttacct ttaaatataa cttctaactt taattcattt 8400 

atttgctcct gcatttgagc atagggaatt caaagaagct gggccacatc ttgaatgctt 8460 

tgctgcttca aaatttatgg ccacgcttgg tggctcacac ctgtaatccc agcactttgg 852 0 
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gaggcctagg tgggcagatc acgagatcag gagatcgaga ccatcctggt caacatggtg 8580 

aaacccatct ctactaaaaa tacaaaaaaa ttagcttggt gtggtggcgc agacctgtag 8640 

tcccagctac tggagaggct gaggcaggag aattacttga acctgggagg cagaggttgc 8700 

agtgagccca gatcatgcca ctgcactcca gcctggtgac agaataagat ttgatctcga 8760 

aaggaaggaa ggaaggagga agggaagaaa tgtcttcccc ccagatgtcc tgggtcatcc 8820 

ctcttatgtt caaacttcaa cagatcccta gggcatgaaa ataatacagc caaattattt 8880 

gctaaggcat aacgaaagtg acctttgctc cagttcccaa taagttcctc atttccatct 8940 

gagactcatc accctggcct tggcttgtcc atatcactgt cagcattttg gtcacaatca 9000 

tttaaccagc taatcgggag gctgaggcaa gaggatcact tgaacccagg aggttgaggc 9060 

tgcagtgagc tgtgatcaca tcactgcagt ccagcttggg caacagagca agatcctgtc 9120 

tcaataaata aataaataaa tacataaata acttaagttt atttaaagct gcatctttgc 9180 

caccatggag aaaggccagg ccagctcctt ctctctttct gcacgtgttc ctcccacctc 9240 

agctgcctct gctcctcaag gaggaacaga gggagtagga aaggccatcc caggaggccc 9300 

agcaccccat gacctggctc tggggccttg tgggtttatg gattcccagt gctgagtcat 93 60 

ccctcacagg ctcttgtggg caccttggac attggtcaga agcatgtggt ccccgggaac 9420 

acaccttttc ctgatcatct gggaagggca gcttgtgcca gcgaggccac ctgttcagcg 9480 

ccacggcccg ccagacagct gcagccacag ccttgccttt gatcagagca aacaccagac 9540 

atgtgtgtca tgcccccaac ccatctccag gggacacatg tcctttcttg ccaggcctga 9600 

gatgaacaag agagggacaa gtccccaagc ctctcfcctcc ttcctgcctc acccactccg 9660 

ctgttagatt ctcaaggtgg atggtgggct aactagggca accgaccatc ctggtttacc 9720 

tagaactgag ggggcatttt caggaataaa actgcaaaag tctggagcaa acaggagcaa 97 80 

gttggtcact ctggggctgg tggagtcagg tttccttctg caggccccct ccccgcaagc 9840 

atgggtggaa cccaggacag gaacacagag caggccccag gaccgggctt gtcacttaca 9900 

agtctttttt tttttttttt ttttgagatg gagtcttgct ctgtcatcag ggctggagta 9960 

cagtggtgcc atcttagctc actgcaacct ctgccttctg ggttcaagtg atccccctgc 10020 

ctcagcctcc tgagtagctg ggactacagg tggcaccacc acgcccagct aattttttgt 10080 

atttctagta gagatgagat ggccaggctg gtcttgaact cctgacctca agtgatctgc 10140 

ccgccttggc ctcccaaagt gctgggatta caggtgtgag ccactgtgcc tggccccact 102 00 

cacaagtctt aaaccatgcc tcagcacatc aatgccattt acaaaaaggt agagggattt 10260 

tccaggcaaa aatagatgaa agacatagga tgattgatca tgtcctgctt aaacataggt 10320 

ctgatgctat taagaattga gggctgggag cggtggctca cgcctgtaat cccagcactt 103 80 

tgggaggccg aggcgggcgg atcacgaggt caggagatcg agaccatcct ggctaacacg 1044 0 

gtgaaacccc atctctacta aaaatacaaa aaatggccgc gcgcggtgac tcacgcctgt 10500 

aatcccagca ctttgggagg ccaaggcggg cggatcacga ggtcaggaga tcgagaccat 1056 0 

cctggctaac acagtgaagc cccgtctcta ctaaaaaata caaaaaaaat tagccaggca 1062 0 

tggtggcggg cgcctgtagt cccagcaact tgggaggctg aggcaggaga agaatggtgt 1068 0 

gaacctggga ggtggagctt ccagtgagcc gagatcacac cactgcactc cagcctgggc 10740 

gacagagtga aactccatct caaaaaaaaa ataaataaat aaataagaat tgttagtatt 10800 

ttgcaggtgt gacaaatgat tctgtttctg tggcagaatg ttctcaggag atctcttttg 10860 

aactctcatg gaaagcatca tgctgttggc aacatcacat ttatttttat ttatttatta 1092 0 

ttttttagag acagggtctt gctctgttgc ccaggctgga gtgcagtggc acaatcacag 1098 0 

ctcactgcag cctcaacctc ctgggctcaa gcaatcctcc tgcctcagcc tcccaaagta 11040 

gctgggacca caggcgtgag ccactgcact cagcccaatg taccttcaat atttacattt 11100 

ctggcaaagg tagcaaaacc ttaacaaatt ttgaatctag ataataaaat tatgaggctg 11160 

ggtgcagtgg ccctgacagg gatggctcac atctgtaatc tcaacatttt gggaggccaa 1122 0 

ggtaggcgga tcacctgagg ccaggagttt gagaccagcc tggccaacat ggtgtaaccc 11280 

tgtctctaac aaaaatacaa aaaaattagc cagacgtggt ggtgcacgtc tgtcatccca 11340 

gctactaggg aggctgaggc aggagaattg cttgaacccg agaggcagag gttgtgatga 11400 

gccgagatcg cgtcattgca ctccagcctg ggcaaaagca agagcgaaac tctctctcca 11460 

aaaaataaaa aaaaaataaa ttaatgaatt aattaaaata aaataaaata atggatagtc 11520 

actgtaaaga aaaaataaat gtatatatca gccaacaagt gatggaatag agcaccccat 11580 

ctccctggct ggacagatac atcccacaac acctggaagg cggctccatg tagaactttc 11640 

tggactgctt gaggtgctgt gctggagcac ggtgacagag gagctggacc atggacctcc 11700 

cccggccccc accaagggcg aggtccccct gtggtgggtc tgagggaggc atccgtatgg 11760 
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cctctgcggc ttgggcaggg aatttggggt ccaagtactt ggtgcaaagc ctggaaagag 11820 

ggtttgggtg ctgagggcat atcccctggg ccacatgggg gcagaagtgg ggccccctga 11880 

agcttggagt cctgggcagg ggcatctatt ttgctgtctg aggccttcag tacttgaagc 1194 0 

aaaatggagg cagaatgtcc caccttaatg cccctgattc ctccaaacca attccagaga 12000 

cagcaagggc cagaacaggg atggccctgc ccagggtcat gcancgagga agtggccagg 12 060 

ctgggatctg aacccaggct aatcccctcc cttgtcctcc tccaggccct cacccctgca 12120 

tagagccctc cagctcactc atcctcggcc agctccatct cctcagcttg taaacccccc 12180 

cgggattttc ctttcttaaa aaacaaaggc ttggccaggc acggtggctc acgcctgtac 12240 

tttgggggtg gctcccagca ctttgggagg ccaaggtggg cggatcatga ggtcaagaga 12300 

ttgagaccat tctggccagc atggtgaaac cctgtattta ctaaaaaaaa aaaaattaac 12360 

tgggcatggt ggctagctac ttaggaggct gaggcaggag aatcgcttga acctgggaga 12420 

aagaggttgc agtgagccaa gatcgcgcca ctccacttta acctggcaac agaacaagat 12480 

tccgtttcna aaaacaaaca aacaaacaaa taaacaaaaa aaggcggagc gcgatggctc 12540 

gcgcctgcaa tcccagcact ttgggaggct gaggcgggcg gatcacttga ggttaggagt 12 600 

ttgagaccag cttggccaac atggtgaaac cccatttcca ctaaaagtac aaaaatcagc 12660 

caggtgtggt ggtgggtgcc tgtaatccca gctactcagg aggctgaggc aggagaatcg 12 72 0 

cttgaaccca tgacctggag gctacagtga gctgagattg cgccactgta ctccagcttg 12780 

ggcaacaaga tttgtttctc taaaaaaaaa aaaaaaaaga ctggcccttc cccttcagct 12 840 

cttcctcagg gtccctgagc actctacacc cccgtctaca ctgagcactc caccctgctg 12900 

tctacactga gcactccacc ctgccatcta cactgaggac tccaccccac tgtctacact 12960 

ggctgcctcc cgccctcacc tcctgctaag gccattcccc gctgcatctg tcttctagat 13020 

tctgcagcct tcagcacgct gggcccctcc tttgtcccct tgagccacct ccagcctccc 13080 

cctgagctgc tactcctctc ccagcagcct ccacccaagc ccctccagtc cccaagctgt 13140 

cccttgcatc cagcactgcc cttccacgtg ccccttccct ccagcttcac agcagggtgg 13200 

ggcctccagg ccctgcccac tgtgcccatc cacaagttgt ggtgggagct ccgaggggag 13260 

gcaggggtgt gcatggactt gggacgtcca agtctgggac caggggcagc tggttggtgg 13 320 

agtgtggagg gggataggga ctttcaggta gagaggctgt aggggcaaga tcgggacggc 133 80 

ggatgtccct aaggagggct ctgacctggg aaatattgtg cagcttcctc tttgccattc 1344 0 

ctggagctca gacactggcc ggctctcacc ccgcccttcc tgcaggacac agctccatcc 13500 

cagtgagttc ctagtgtaga catctccagc agcacggatg ggaaaggaag tcatcaaagg 13560 

tgcccaggac cggaggcttt ttctggaggt ggcagaggag ggtgtgggtc tcagggctct 13620 

ggctgagggc aagcgtggga ggtcttaggt ctgcaccagc cccgtgaagg cccctcctgc 13 680 

tccctggtgg agtcctagag ggaacagcag cccctaggct ctagcaggag tgggtagggg 13 740 

cttttctggc ttcctactgt gccagcagga tagctgggcc tggcactgag cccaaagatc 13 800 

acatgccggg gcattggcgc agtgaggaac agacccttgc caaagctggc aaagaagacc 13860 

ccatggggtg cagctggtga agctgagagc tcaatgtttg ggggagcctg gcaaaagggg 1392 0 

tcctcccctc cctctgcagg ccaggatcgc aggttttccc tacatgttgg taattctcaa 13980 

acaatcccat ggccactgga gcaaagatca cagtgggcgg cggcctcggg 'agcagtggac 14 04 0 

agggcacgca gtgcctttga tgccagagcc ctcgccccaa agtcaacaaa ctctgcagcg 14100 

gactttgcac ccggactttg ttttcaccat acaaggaaag ggacagatca caggccctct 14160 

cgctgccctc gctgagccgg aagctgcagc gtgagctctc tcaagcccca tttctaggtt 1422 0 

ccccaggcgc acccctgagc ccctactcgc ctattaagtt ctcctaatag cccttcaagg 142 80 

tcttaatgta tgtccattag acagagggga aaactgaggc gagggcaagt gacttgaccg 14340 

aggttcctcg gcgagcaggg cgtggagctg agaacctcgt tattactgct ccccacacaa 14400 

ccctctggcc gttcttggaa gaaggctgag ccccgggggg gccagagtga cccaaacacc 14460 

atgggccgcc tgcggtaaca cgtgcggcca cgaaggggca gcagtttccc gcccggccgg 14520 

gctctctccg gcgctcagta tccgtcccag gccaagaaga agaaactcgg ggaggagggc 145 80 

ggagggggct gcgtgggagg gcgtggaaga tggacgtggc caggggagtg gcagctgcac 1464 0 

acagtggatg ctgttaagat gaagggaaag aacgtgggct ccgagatcac tggacacggt 14700 

tccacctttc ttcccgctca ctgcatggcc ctgggcgggt tgttgaaccc ttggaaacct 14760 

gtttttcctt ttttcctttt tttttgagac agggtcttgc tctgtggccc agactggagt 14 820 

gccgtggcac gatcttggct cactgctgcc tcccaggttc aagtgatcct cccagctcag 14 88 0 

cctcctgcgt agctgggacc ccaggtatgt gtcaccacag ccggctaatt tttgtatttt 14940 

tttgtagaga cgggatttcg ccgtattgcc caggctggtc tcaaactcct gagttcaccg 15000 
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gatcttcctg cctcagcctc ccaaagtgct gggattactg gcatgagcca ccgcacccag 15060 

cagagacctc agttttctaa cctgtgccag caggaataat gatagctgcc tagcttggct 1512 0 

gtgctgggaa ttaagtaaga tgaccgggta gcaaatatga agtattactg gacacagagg 15180 

gccccaggct gggttagcag cggtggtcag ggctgctgct tcctggcctg agctcgaagg 1524 0 

agggccctca ttaccacctg ggtgagtcct cgtccaagcc tggcactgct gcgtgggaat 1530 0 

aacttctgcc acccaagttg gcagattgtg tgcaaagtta agtcctgact ctgtggggtg 15360 

gacttcgagg cctcttcatc ggacctgctt ccggtgactg cattcgcacc tcctcctgtt 1542 0 

cctggtttaa cacagcccag ctttcctcct gctgagccct ccctgggcct gctgtcaccc 154 8 0 

tcgtgccgct gtgcctcgca gtgccactcc ctgtaccctg aatactttgc cctgcctctc 15540 

cacccagctg agagtcaggg cccctgtgag gctctgccca gcccgtcctc cgggtttctg 15600 

cctctgctga gcacttccct gcatgattgc ttctgagagt ccccccagcc tgtgagcttc 15660 

tcaggactgg gacagcttct caggaccgag gcttcctggt ctgcttgcaa ttttacaggc 1572 0 

gggcacattt tcccttggcc aacatcagag actggacatc tgcagatctg tgctagccac 15780 

tgagcaccca ggcaccccag caggtagctc tgtaaccaac ccattctgta aagctgaggc 15840 

tcagagaggt gaagcgcctg gcctggggcc acagcctgcg tcagctgcag agccaggagc 15900 

tgagatatgc acctgcggct ctgctcacag ggtcctgcac agactgctgc tggagccacc 15960 

tatgtagagt caagagagtt catgttaact ccctctcaca tccctcagcc agggtggggg 1602 0 

ctgacgatag acactcaggg atggcctacc ctccccaaca acccccgtca ggtttgccgg 16080 

atctccttgg aagaaaagtt ctgggcagaa ttccaccgtt ggcctggcct acactctcct 16140 

tagtggctta ggaccctcag cggtggataa gttgtgggca gaagagatgc aatcaggatt 162 00 

ctcacccact caccccttgc cagccccaat aagctcaata agctgggctc ggtctgagga 162 60 

agtgtccagg aaatgtgcaa atggcctggg acagccctgt gttcctttca gtaaggttgc 1632 0 

tgaaggtgag gctgaaagtt ggagaaacag aagccagtgc ttatggtttt aattaagata 16380 

atggaatgta tgtatgtatg tatgtatgta tgtatgtatt tatgtattta tctttagaga 1644 0 

tagagtctca ctctgttgcc caggctggaa tgcggtgaca caatcatagc tccttgcagc 16500 

ctcgacttcc tatgcccaaa tgatcctcct acctcagcct cctgagtagc tgggactaca 16560 

gacacacgcc aactatgcct agctaatttt tatttctatt ttttgtggag actgggttct 16620 

cactttgttg cccaggctgg tcttgaaccc ctagcttcaa gcaatcctcc tgcctcagcc 16680 

tcccaaagtg gagggattac aggtgtgagc caccacacct ggcctggaat ttatttgtat 1674 0 

tctgcttata aaattaatac attcttattg cagaaaagtt tgaaaataaa agaaaggaca 16800 

aagaacaaaa agcgtatata atttcacagc tcagatctca ctgctattaa catttttatt 16860 

tactttcagg cttttttctt tctaggtaca tatgcagaga ttattttatt ttatttattt 16920 

tattttatat tttattttat attttttatt tcattatttt attttatttt attttattat 16980 

ttttagagac agggcctcac tctgtcaccc aggctggagt acaatggagt gatcatagct 1704 0 

cactgcagcc tcaaacacct gggctcaagc aatcccccca ctcagccttc tgagtagttg 17100 

ggactaaagt gtgagtctgg ctaatttttt ttactttttg tattgacaga ggtctcacta 17160 

tgttgcccag gctgatctca aactcctggg ttcaagcgat cctcccacct tggactccca 1722 0 

aagtgctggg attacaggca tgagccacca tgcctggcct aaaatgccac tttttgtcat 172 80 

ttactaaaat cccatggaca ctttgacatg tctgtattct atgctattga tctgactgtt 17340 

ggcatctaca tcattatggc catctatcat ctatcataat ccattttaac attaaaattg 17400 

tgctgctgct tagatttttc tggcctgtct cctatttgta ttcttccaga taaattttag 17460 

aatcatttta tcaaattccc cttgcagaaa aagccctatt ggattttggt tgaaaaatac 1752 0 

tgaattttta cattaactta ggaaagggct gggcacggtg gctcacgcct gtaatcccta 17580 

cacttttcga ggccaaggca ggtggatcac ttgaggttgg gagtttgaga ccagcctggc 1764 0 

caacatggtg aaactcggtc tttactaaaa atacaaaaat tgccaggcgc attggctcac 17700 

ctgtaatccc agcactttgg gaggccgagg tgggtggatc acgaggtcag gagatagaga 17760 

ccatcctggc taacacggtg caaccccgtc tctcctaaaa atacaaaaaa ttagccaggc 17 82 0 

gtggtggtgg gcgcctgtgg tctcagctac ttaggaggct gaggcaggag aatggtgtga 17 880 

acccaggagg cggagcttgc agtgagccaa gatcgcgcca ctgcactcca gcctgggcga 17940 

cagagtgaga ctccatctca aaaaaaaata ataataataa tacaaaaatt agccgggggt 18000 

cgtggcgtgc acctataatc ccagttactt gggaggctga ggcaggagaa tcgcttgaat 18060 

ccaggaggtg gaggttgcaa tgagcagaga tcgtgccact gtactccagc ctgggtgaca 1812 0 

gagtgacact ctgtgaaaaa aaaaaaaaaa ttctgaagga ttgagactct tagactctta 18180 

ggtcttccta tccaagagca caatatagct tttcatgtat tcaagccttt ttcaatgcat 18240 
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caacagaatt ttacagtttt tttcatgata 
gatattctgc atgttttccg gttgtttgtt 
taattggctg ttatttgtat atatgacatc 
ccattctttt gtgttttttt ttaactttct 
ttgcaaaaat agtacagaga gttcctgttt 
gttaacatct tacataacta cagaacaatt 
gctattaact aaactgcaga agctgttcag 
tctcttccag tgttcaatcc ggaatcctac 
tcttccaatc tgtgacagtt cctcagtctt 
tacttttgaa aaatactggc cggttgtttt 
agttttttgt gattagatcg aggtcatgca 
gcaagctcaa tgcatcatat cagagggttt 
aacctggagc atttcgtgaa ggtgctgtct 
tttccctttt taattactga atgtctgagg 
gtttctgctt taacttcggc tcactaagtt 
aagtattact gtggagttct aatggtaatt 
taatatgctt cttcctcact tattcatttt 
tggatattgg ttttattctt tgggttgcaa 
tgttccatcc gaccttggtc attaggagcc 
tttttttgag accgagtctc actctgtcac 
cttcctgcaa cctccgcctc ccaggttcaa 
ctggtattat aggcgctcca ccaccttgcc 
ggggttttat tatgttggcc aggctggtct 
ctcggcctcc caaagtgctg ggactacagg 
attttaacat gcccgtcttt tctttttctt 
ggatacattt ttgctgccct agacttagcc 
ttttatggga tacttagaaa actagctctg 
atcccagtac tttgggaggc cgaggtgggc 
gaccatcctg gctaacatgg tgaaactctg 
ggcgcggtgg cgggcgcctg tagtcccagc 
tgaacccggg aggcggagct tgcagtgagc 
ggccacagag cgagactccg tctcaaaaaa 
ctctgtatgc tagttttttt tttaagacag 
gcacgatcac agctcactgt agcctcaacc 
tctcctaagt agctgggtct acaggcatgc 
tttgtagaga tggagtctcc ctatgttgcc 
tcctgcctcg gcctcccaaa gtgctgagat 
ccaaggtctg tcttttttta aaagaagttg 
ttctgagacg gagtctcgct ctgtcgccca 
actgcaagct ccgcctccca ggttcacgcc 
ggcctacagg cgcccgctac cacgcccggc 
tttcaccgtg ttagccagga tggtctcgat 
ctcccaaagt gctgggatta caggcgtgag 
atctcgagtt ttctagcgat ttaatcatat 
cctttccaat gtttctacat attttttcat 
taattttaga atggtttcat atttacagaa 
tgtactccac acccggtttc cttattatta 
taacattatt attcactgaa gtccacactt 
cttctgtcac ccaggctgga gtgcagtggc 
ctgggttcag gcaattctgt ggctcagcat 
accacgcccg gctaattttt tgtattttta 
gatggtcttg aactcctgac ctcgtgatct 
tacaggcgtg agccaccgcg cccggcgtcc 
ctaacgtcct tcttctggtt caggatccca 



tcctgctatt tcttataaaa tgtattccta 18300 

aataaatatt tttcatttgt cattatttcc 18360 

tgttgaattt tttgattact ttgaaaatgg 1842 0 

attttgagat aattttgact tacagaagat 1848 0 

cccccttatg ttaacccagt ttctccttat 18540 

gtcaaatcta agaatcaacc tgggcacaat 18600 

atctcaccag ttcttctact gctccccttt 18660 

attatattta gttgtcattt ctctttggtg 18720 

tctttgtctt tcatgacttt cattttttta 18780 

gtagaacgcc ctcagtttgg gtttgcctga 18840 

ttattggaga gggtgccacc gcctcgatgt 18900 

gtaatgtcag ttfcataccgc cggagaccct 18960 

gccaggattc tccactagaa agttactatt 1902 0 

ggaaatactt tgagactatg caaatatcct 19080 

tagcattcat ctatggatct cgcttatagc 1914 0 

ttctgtttct ctcattcctt caacctttat 19200 

gtttcagttg tttataccaa catggatttg 19260 

ttgaatccta tcattatttt gttagtcagt 19320 

cttgaaattt ggctcccatg cctttttttt 19380 

ccaggtttga gtgcagtggc atgatcttgg 19440 

gcaattctcc tgcctcagcc tcctgagtag 1950 0 

cggctaattt tttgtatttt tagtagagat 19560 

caaactcctg acctcaggtg atctgcccgc 1962 0 

cgtgagccac cacacctggc ctcctatgcc 1968 0 

tcctactttc tgtgactgta agaagctcca 1974 0 

tcaatcagtt ctcagaaaag ctctggttct 19800 

tatggcctgg cgcggtggct cacgcctgta 19860 

agatcacaga tcacgaagtc aggagatcaa 1992 0 

tctctactaa acatacaaaa aattagtcca 19980 

tactcaggag gctgaggcag gagaacggca 20040 

cgagatcggc agccactgca ctccagcctg 20100 

aaaaaaagga aaaagaaaaa agaaaactag 2 0160 

ggtctctctt gccccagctg gagtgtagca 2022 0 

ttctgggctc aagcaatcct cctgcctcag 2028 0 

accaccgtac gtggcaattt ttaaaaactg 20340 

tggtctggaa ctcctggcct caagtgatcc 2 040 0 

tacaggcatg agccactgta cctggcctgg 20460 

ttgtatagtt gttttttttt ttattttttt 20520 

ggctggagtg cagtggtgcg atctcggctc 20580 

attctcctgc ctcagcctcc cgagtagctg 2064 0 

taattttttg catttttagt agagacgggg 20700 

ctcctgacct cgtgatccgc ccgcctcggc 2 0760 

ccaccgcgcc cggcctgttg tatagttttt 20820 

tggttacaaa aaaggatgat tttactacct 2 0 880 

tttatctaac tgcattttaa aataaacttt 20940 

aatgtgcaaa gatagtacag agagttcctg 21000 

tcttaacgtg atacacaatt aataaaccag 21060 

tctttttttt tttttctgag acggagtcta 2112 0 

gcaatctcgg ctcactgcaa cctccacctc 2118 0 

cccaagtagc tgggaataca ggtgcccgcc 2124 0 

gtagagatgg ggtttcacca tgttagccag 213 0 0 

gcctgcctca gcctcccaaa gtgctgggat 21360 

atactttctt tagatatcct tcctttttac 21420 

tccagaaagc aacattaccc ctcgccatca 21480 
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cgtcttcaca ggctcccctt gacgggaaga gttcctcaga ctttccttgt ttttgttgac 21540 

cttgacagtt ttgaggagga ctggtatctt agtctgtttt gtgctgctat cacagactag 2160 0 

ctgagaccga tacatgatac atgaaaaaaa atgtattctt acagttgtgg aggctgggaa 21660 

gttcaagacg aagttgctgg ttggtttggt ctctggtttc aagatggcgc cttgctgctg 2172 0 

catcctctgg agaagaagaa tgcggtgtcc tctcactgca gaagatggaa gcgctaaaag 2178 0 

gaatgaactc cctttgccaa gccattttat aatgggcatt aatccacaaa ggatgaaacc 21840 

ctgagaaaca tcaagcfctta aagcactggt tctcaacctt tttggtctca ggagcccttt 21900 

atactcttaa aacgttttga ggatcccaaa aaaaggcttc tacaggttcc atcttttaat 21960 

atttaccata tcaaaaatta aactgaaaaa attttaaatt atttattcat ttaaaataac 22020 

aaggataaac ccattacatg ctaacataaa tcatgtattt tatgaaaaat agctatattt 22080 

atcaaaacaa aaattagtga gaagagtggc atgtataatt ttttttgttt attttttgtt 22140 

tttagatgga atcttattct gtcgcccagg ctggagtgca gtggtgtgat ctcggctcac 22200 

tgcaagctct gcctcccagg ttcacaccat tctcctgcct cagcctcctg agtagctggg 22260 

actgcaggtg cctgccacca cgcccggcta attttttgta tttttagtag agatggagtt 22320 

tcaccgtgtt agccaggatg gtcttgatct cctgaccttg tgatccaccc gcctcagcct 22380 

cccaaagtgc tgggattaca ggcttgagcc actgcgtctg gcctaaattt ttgtgaatgt 22440 

ctttaatgcc tgccttctca tatttgtttc tgcattcaag ttattgcaaa atgttgtgtt 22500 

ggttgaagtt tgtaaagaaa atgtggcctc atacagttgt gtagttggaa aggcaagagt 22560 

attttgattc tctcttcaaa caactatgga caacctgctg ttacaaaacc agaatgcaaa 22620 

aagttgtagt aaatacaggt taggtgtagt gtggaatctg aaagcatgtg aatgaacttt 22680 

ctgagttttg taacattaaa gtccagttgc gttaagctac tgtgatagca tatagcattg 22740 

tcctaatact ggaattagta tcagaagtgg ggtgctactg ttaataaata aaaagaaata 22800 

aataaatcat gtgatactgg ctcagaagtc aggcagtagg ctgtgtggaa cctgacatca 22 860 

cgccatgtaa tacattggca accatttgat ccagctgtct gtcatgatga cttggaaagt 22920 

caaccacata cttacagagc ctgtagacat aggggaaaat agtataaaac agaatactaa 22980 

cagtggacct tggttcttgc cagttgcatt tagccaaata ttaaacaaaa gagatattct 23 040 

tgggcagcaa ctggaccatc ttcaagtaaa agtgaaaggt aataaacaga gtccagacat 23100 

ttgtgcccat gcgggttaag aaaaatccag ttgcttctag acaccgtata tgaaaacaac 23160 

gctgaaaaca agcctttgag tggtaaaggc cgattaacac tcagcgcggt aacaaagacc 23220 

a 9gtggg c ta acccgaaatg aaatgagaag cctgtggtga tgaggaggca gagaagtaaa 232 80 

atcaagtttg agcatttcgt ttaggagagt ttgggctctg attacttgca catgcaaacg 23340 

aactggaaac aaacagatca gatgtctacc acttcttcga gggaattgca ttgccaaaga 23400 

agtcatgaaa gcagactcta tactgattag gcattaaaac aaaaacaatc tttaggcccc 23460 

taaacttgca tgggcaggaa gtgggctgtc aaagctgttc atcctctaag gtggacctag 23520 

ttcctagtcc ccagtataca cttcagatgt ggccctggag gacactggac atggaggacc 23580 

tcccagagga tgaggctagg gcttcatttc tccaatgacc tcagctgcct ctatttcccc 23 64 0 

ttcttcctct ggaagtccta tcatcgttat tattattatt atcatcattt ttattttgag 23700 

ataaggtctc gctctgttgc ccaggctgga gtgcagtgac atgatcatgg ctcactgcag 23760 

ccctcccagg ctcaagtgat cctcctgcct cagcctcctg agtagctggg agtacaggca 23820 

catgccacca tgcttggcta tttttttttt cagtagagat agggctctca ctatgttgcc 23880 

agggctgatc tcaacctcct gggttcaaga gatcctccta cctcagctcc tgagtagctg 23940 

ggattcgggt gcacaccacc atgccaacta atttttaatt tttttttgta tggacaggat 24000 

gtacagtgtt agaaatggat tgcttgcaga ggcaggagga tcacttgagc ccaggagttt 24060 

gatcacactg tgaaccatga tcgcacccct gcactccaat ctgggcaaca gagtgagacc 24120 

ttgtctcaaa aaaaaaaaaa aagagagaga gagagagact caaagatagg caaaaaagtg 24180 

ggaaagcttt atagtggaca aaaaggaacg ctctaagtct gccctattgg catggtgctg 24240 

aaggtgggct aactagagat agggggtact atgtggttga ctatgggtgc atctttggct 24300 

ttccctgggt gatcctaagt tggaagcagg gacaaaaatt agggaagctg ttagttattc 24360 

atcacgttct ggcagtagtg gactggttgt gatagaagtt attgttttgg ccaggtgcgg 24420 

tggctcatgc ctgtaatcct agccctttca gagttcaacg tgggtggatc aggaaggagg 24480 

gaggatttgg gaggtcagga gttagcctgg ctaacctggc gaaatcccat ctctactaaa 2454 0 

aatacaaaaa ttagctgggc gtggtggtgc atgcctataa tcccagctac tcgggacgct 24600 

gaggcaggag aatcagttga acctggggag gcggaggttg cagtgagcca agatcgtgcc 24660 

caatttcatc tcaaaaaaaa aaaaaaagtt atcgtttagc ttcctcgatt gttactggac 2472 0 
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gtagtaatct 
tgtagataag 
cgtaatcaag 
aaaaaatcag 
acacgtctag 
agtgaccata 
accgtgtttt 
accccatgta 
ttcaaatagg 
actgtggggt 
aggagcgggg 
gagttcaaga 
ttatccaggc 
cagtgagccg 
aagaaagaaa 
ttttgggagg 
acggtgaaac 
cctgtagtcc 
agcttgcagt 
tgtctcaaaa 
atgacaaata 
aagagagtgg 
tgggcagatc 
tctttgctga 
cacttgagcc 
tgggtgacag 
aaatctgtaa 
tggccaacaa 
gcatgtgcct 
tacgtcgtta 
aactacagtg 
gtagatactt 
cccctcctct 
tctcgaggcg 
ttgggttttt 
tctcgtggtt 
acccccccaa 



ggcttcctgc 
gcagtaagta 
cagaagattt 
taaggatgta 
aaccctgcag 
tggtggacca 
ctgaccgtaa 
tctggaaatt 
aatttagaaa 
gcagctgaac 
aggattgaaa 
tcagcctggc 
gtagtgaggc 
cgattgcgcc 
aaaaaaaaag 
ccgaggtggg 
cccgcctcta 
cagctactca 
gagccgagat 
aaaaaaaaaa 
aaccaaaaaa 
ctgggtgcag 
acttgagccc 
aaataaaaat 
taggaggttg 
agtgggaccc 
tcccaccact 
gatgaaacct 
gtaatcccag 
ttggactgaa 
gcttccgacc 
ccttccttct 
ttatcttttt 
gtcctttcct 
atctctcacc 
tctcttcctc 
gcgctttgcc 



aagtctaact 
aagcaaaaat 
gaacacttca 
gaagatttga 
aacacacact 
taaagcagtt 
gtacagttaa 
ttaatataca 
taccttgaac 
agtatataga 
gttaatggga 
taacagggtg 
tgaggcaaga 
actgcacccc 
aaaaggccag 
cggatcacga 
ctaaaaatac 
ggaggctgag 
cgcgccactg 
gttaatggga 
aaaaaaatca 
tggctcaggc 
aggagttcaa 
aaaaaattag 
aggctgcagt 
tgtctctaaa 
ttgggaggcc 
catctctact 
ctacttagga 
ttcagaatga 
ccactctggt 
ttctcacttg 
ctcgcgacac 
ttatccagct 
ctccctctaa 
tactctcggg 
ttttttttct 



ttcagcagac 
tgatagagca 
ctttcagtaa 
acaacgtaat 
ttttcaagca 
tcaacaaatc 
cctagaaatt 
ctttgaaata 
tgaaaaataa 
cgaaaagtat 
ggcaaaagca 
aaaccccatc 
gaatcgttgg 
agcctgggag 
gcgcggtggc 
ggtcaggaga 
aaaaaaatta 
gcaggagaat 
tactccagcc 
taaacatcca 
aaagaagaaa 
ctgtaatctc 
gaccagcctg 
ccaggcatgg 
aagccatgat 
aaactaaaat 
aaggctgagg 
aaaaatacaa 
ggnnimctnt 
tgactctcat 
tttcacttca 
ttgctcttcc 
ctgcgttctc 
taaatacctt 
tttctttcct 
tcccccccct 
ttgcccttta 



tggctacatg 
tcaaggataa 
ctgataaaac 
taacaaactt 
tactcagaac 
tcacaggagt 
gaaaacaaaa 
acaaatggat 
tgagaatact 
actcatatgt 
ggtggatcac 
tctactaaaa 
aacccaggag 
acagagcgag 
tcatgcctgt 
tcgagactat 
gccaggcgtg 
gtcatgaacc 
tgggcaacag 
tctcaagaag 
atcataaggt 
agcattttgg 
agtaacatag 
tggtactgag 
tgtgccactg 
aaggctgggc 
tcagcagttt 
aaaattagtt 
ngattatatt 
tggagctctt 
cccctctgct 
tcaacccccc 
ttgccctctt 
ctcctctgtt 
ctttccgcac 
ctccccttct 
ttcccccc 



ggctgtgtac 
atagaaaatc 
aagtagacaa 
gacttgattt 
atttatataa 
aaaataacag 
agctagaaaa 
cagagattaa 
ataccccaaa 
gcatacctta 
ttgaggttag 
atacaaaaaa 
gcagaggttg 
actccatctc 
aatcccagca 
cctggctagc 
gtggcgggtg 
caggaggcag 
agagagactc 
ttagaaagga 
caagactata 
gaagcagagg 
agagacctca 
gtgggaggat 
cacttcagcc 
gcggtggctc 
gagaacagct 
gggtgtggtg 
ttctccttcc 
cctgtctcct 
gctcatacga 
ccgttggtgt 
atcatccctt 
tatttggggg 
ccatcaagcc 
ttttttcttc 



24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26928 



<210> 7 

<211> 29430 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 

<222> (4336) , (4345) , (4349) , (4392) , (4447) , (4490) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 7 

aggggaaggg ccggctccgt agctcacacc tataatccca gcactttccg aggagagagg 60 

atcatctcag gccaggagtt caagaccagc ctgggcaaca cagcaagacc gcatctctac 12 0 

aaaaacttct tttaaagctt aaaaaaaaaa aaaaaagcaa agaggacagt tcaggagaaa 180 

agcctgtaga ggcagcacac taaggaggag acgcagccca ggcaccagga ggggctggcc 24 0 

atgggcactc actcctccag caggcgagtg cccagcacca gctggcccac ccagacaccc 3 00 



-30- 



WO 02/092000 



PCT/US02/14877 



aggacacggc ctgaatggct ccgtattcac gtgggtggta ataaacaagc aatacacata 3 60 

gccaataagg acaccttagt aatgttacat cataaacgct gcagatcagg gaaatggtgc 420 

agggtgaagt gggttggggg gctgcatgct acatgagaag tgggtcgggg ggctgcatgc 480 

tacctgagac agagcaggcc ttgctgggaa agaaggagcc ggcaggcctg ggcaaaggtc 540 

ct <jgggtggg agcacactgg agcagagtgt gggggtagca tggcgggtgc tggtcctctg 600 

ggcgccttcc caccacgtca tgtgcccatg tgcccaaggt ctctcgtttc acagccccct 660 

gaagctcagg ggtcacagct acacagcccc cagatacctt ggcctgcccc aggtcattcc 720 

atccagtgat ggacctgctg acctctagcc tgacctctgg gcagcgtaat ttgagaagga 780 

ggagaaggga gggcaacaga cctggggcga tgagggatgc acagggtggc agacacctga 840 

ggctgcacct tggagcctca gttctgggtg tgggtggggg atggacaggc tgagggctga 900 

agcagctggg cccggccacc atcacacccc aggacccacc agatcaccat gaaaaaccga 960 

atgtcaactg gcagcccaga gtgcagaaca aacctttcag aaacacggtg gtgactgccg 102 0 

catcatgaac ataaaataat tacgccctct ccccagggat cacccctgca ggagtttgtc 1080 

ccaagaaaca ccagaaagaa ggaaaacgtc tgagtcacaa tatttgctga ggccttattt 1140 

gtaatagcaa aaaaaaaaaa aaaaaaagaa caatctccag cggcaggggt aactagacta 1200 

ttgtctccgt ggaaaggtag caccaattaa ctagtaacaa aatgactgcg gtaacaacaa 1260 

aacgttcgac atgtcaacac caaaaaccac acacccagca taaccgtgaa ccatgatttc 1320 

tactagaatg aatggcagtt atgagaaagc accagcggag acaaagattg aaaaagtaaa 1380 

ggtggcctca ttagggagac aagtctctgg gtaatatatt gtaatactgg taaatatata 1440 

gtttttaata tattttttaa ttccaaattc catatatgtt cctatgaagc tatttctgca 1500 

aatatttttt tcaggaccgt acatcacaaa ggcaaaaggg ccaggtcagc tctccagctg 1560 

agagtgacca cttcagagca gacggcagac tccagggtta gcaagcctgg ctgagacctg 162 0 

gcccatgaca atcactcaac ccctctgacc tcaacatcct gtctgtgaaa tggggataat 1680 

tactgcacct ccacatcaca gagtgcgagg cttaaacagg atgcttcata gaaaagcgct 1740 

caagaggtaa cagccgggag ggggtagtgg ttttcattaa ttaaatgttg ccttcatcca 1800 

gccctgggcc agctccaaca caaagcacac accatccact cagactcagt tgcctggatt 1860 

caaagcccgg cctggcctcc agctgtgaga ttccgggcag gatttcccat ctcccagagc 1920 

ctcagtttcc tcattcatga aacaggaagt gatcattcct tttattttta tttttatttt 1980 

tattttgaga cggagtttca ctctagttgc ccaggctgga gtatgatggc gcaatctcag 2040 

ctcactgcaa cctcggcctc ccagtttcaa gcgattctcc cacctcagtc tcctgagtag 2100 

ctgggattac aggcacacgc caccacgccc agctaatttt gtatttttag tagagacggg 2160 

gttttgccat gttggtcagg ctggtctcga actcctgacc tcaggtgatc cgcccgcctt 2220 

ggcatcccaa agtgctggga ttacaggtgt gagccaccaa gcccagttga caactgcttt 2280 

taaagacacc tctggctgct gtggaaaaca gcctggtagt gcctcaaaaa gttacacata 2340 

gaatgatcct atgaccagta attccactcc tacatatata cccaaaagaa ctgaacccct 2400 

ctactcatgt atgtacacat acaggtacac gcatgttaac agcagtgttc acaaagccaa 2460 

aacatggaaa cagctcaaat gtccataacc gatgaacgga taaatgaaac gtagtctatt 252 0 

caccacctga cggaggtgag aggggccata aaaaggaatg atgcataaaa acgaatatta 2580 

tggccaggta tggtggctca cgcctgtaat cccaggactt tgggaggctg aggcgggcgg 2 640 

atcacgaggt aaggagttcg agaccagcct ggccaacacg gtgaaacccc atctctacta 2700 

aaaatacaca aattagctgg gcatggtgga gggcgcctgt aataccagct actccggagg 2760 

ctgaggcaag agaatccctt gaacctggga aacagaggtt gcagtgagct gagattgcac 282 0 

cactgcactc cagcctgggc gacagaccaa aactccgttt cggaaaaaaa agaaaaaatt 2880 

agccaggtgt ggtggcgggt gggtccctgt aatcccagct ctacttggga tactgaggca 2940 

ggagaaccac ttgaacccgg gaggtggagg tagcggtgag ctgagattgt gccactgcgc 3000 

tccagcctgt gtgacagaag gagactctgt ctctaaaaaa caaaaacaaa aaaggcccga 3 060 

cgcggtgtct tacacctgta atgccaacac tttgggaagc caaggcaggc agatcatctg 3120 

aggtcaggag tttgagagca gcctgggcaa cacggtgaaa ccccatctct actaaaaata 3180 

cagaaattag ccaggtgtgg tggcacatgc ctgtaatccc agctactcgg gaggctgagg 3240 

caggagaatc gcttgaaccc aggaagcgga ggttgcagtg agccgacatt gcaccattat 3300 

actccagcct gggtgacaga gtgagattct gtctcaaaaa aaaaaaaaaa aaaaaaaaaa 3360 

ctaaacaaaa gcaaaaaaac caatgagtaa tgttgtcaag tgaacttcat cccaatggga 3420 

atgcagataa tttgtttaaa aggcaccatg cacactgggc aggctggctt cccctgggaa 3480 

cgtcttcttt tgcctggatt cccagttggt ttaatcgggc gtagaacact ttcttcaatc 3540 
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cgggattcag gcacccctgc tcagcacaaa ctcagtacac cccgcactct gctgtgggtt 3 600 

cttggcacta ttaggagaat gtgagggggt gattcagatc tatctctagt gggtgcatgt 3 660 

ctgccactcc caggaacgcc cacttctggc aagtcagtgt cagagaaagg ccagctcgtg 372 0 

gcccctcctg ccttgagtcc caggacccgt gatcagtcct acccggagca gaatcaggag 3780 

tttgaaaacc caagtgccaa caatctcatt ttaacccatg taagcatatc caatatttat 3 840 

atatagaatt cataacagat gtctgggctt ccattccaat agcctatatt ttacactgtt 3 900 

tatttacatg gttacaccaa acaagactca attcaaggta acccaatcct ttgctactat 3 960 

accaaaataa gcaacatttt cagtccatgc cttatatata ttcaccaagc attacactag 4 02 0 

gcctccaact gctcatcgga gcaagctgca gcctggacac aagctagaga ttaatcagtc 4080 

aggaatgatc ctgcgtccag tgccagcatg atggaagaga cagagaaaca gaagacatca 4140 

gggctccaga gtcaaggagc ctgcaggtta gttgggcagg atatacacac atacacacac 42 00 

acacgcacac acaaaaccac ccaagaagaa aaggtgggat gaatgcatgg acaggtaatg 42 60 

cctggagcct ggggatggat aagctgactg caggtggccc aggcaggctt cctggaggaa 4320 

gaagacctgg ctgtangtgg ggtangcang ctttctaaat ggggaaaatc tggctgtggg 43 80 

tggagttggc angtttccga aaagaagaaa agctgactat gggtacacct ggctgttggt 4440 

ggaacangca ggcttcttgg aagaagaaaa tctggctgtg ggtggatcan gcaagcttct 45 0 0 

tggaagaagt aaacctgact atgggtggac caggcaggct tcctagagga agaagaccgg 4560 

ctgtgggtga accaggcagg cttcctagac agaggaagat ctggctgcgg ttagagtggg 4620 

caggcttcta agaagaggaa gggctgactg tgggtagacc tggctgtggg tagactgggc 4680 

aggcttcctg gaggaggaag agctggagca ttgaaaaaca aacatgactt ggtgaatgtt 474 0 

gagcatgccc aggcctgatc cccagaggca attacgcact caagttactt aattctactc 4 8 00 

acaatgcctc acaaacaact tctctgacac ctaacacagc tctgggcacc ttctagcttc 4 8 60 

agctcctcaa agcagttatt cacgctacta ccctgcacac ctcctcacac cccaacccca 4920 

gggacaggag ttctgccaga tgccaaagct cctgatgcca aagcctgggt ctgcttccgg 4980 

gctcctcttg gtctaactgt ccaccccgca tcggcatgat gtgcaaaaac aaggctttgc 5040 

aatctgccct gatgcctggc ggagcgagtc cctcccgatt cgtctccttc agaaacacct 5100 

gggctgccct ggtcctgtta tacccccaac acattctaca gtcagctccg caagttccac 5160 

aaagatcaac gctggcgttt ttatggcatt ttatttacag tttttacaat ataaaaaagg 5220 

aaggatgcca cagctcagcc agcaggacag acagagatct atgatgcttc tgctgcacca 5280 

ttgtttgtgg tcaagaaagt ctgttttcaa tgatttatta aattgtggtg ggagatggat 5340 

ggtggcagtg gttaccagca acatgaatgt tcttaatgcc actgaacttc acacttacaa 5400 

atggttacga cgataagtgt tatatgtatt ttaccacaat taaaaacagg taaatgcagg 5460 

ccgggcacgg tggctcacga ctgtaatctc agcactttgg gaggccaagg caggcagatc 5520 

acctgaggtc aggggttcga gaccagtctc gccaacacgg tgaaactctg tctctattaa 5580 

aaatacaaaa attagccaga tgtggtggtg catgcctgta atcccagctt ctcaggaggc 5640 

tgaggcagga aaatagcttg aaaccgggag gcagaggttg ccatgagctg agattgtacc 5700 

attgcactcc agcctgggtg acaaaagcaa aactctgtct caaaaaaata aaataaaata 5760 

aaaataggta aatgcaaaca tatggtatag taatattatg ggctattatg agctacaaaa 5820 

aagaatgact tgggactaca gttacagccc tcattcagga atttgtttta aatgtgggtt 5880 

ggtcgctaag gcatgtacac aacattttga cgttcaaata ttcctagatt tggacagtga 5940 

gcacccctct aagctggctc ttctgtccca gaggtcccca ccagtcctcc agaacttctt 6000 

tgctttctta cacaataaga tgccccatgc tcggcttgta cctttccttg ccccagccct 6060 

agaaccagct tcttcgtgga caagctctga ctcctttggg tggagaatgg tattcagaaa 6120 

cccagacctg ggctctggtg tgctcactgc tacttggggt cattgcttct aggcctctct 6180 

gctgatggag gtaggatata cacgtacagt cttccctctt cccagattcc gtacttgagc 6240 

tcgcctactt gctaacattt atttatatcc cccaaattaa acctcacagc acttctgcaa 6300 

tcactcactg acttgcagag tgtgaaaaaa ctgagtcacc atcacacgtt ccaaactgag 6360 

gtcaactgag gccacaacgc cccatcttct tgctccggct gtcgagatgt aagcaagtgt 6420 

ccttctctcg gtctagctag tgccatgctt tccacatcac tgtgcttttt gtgggcaatt 6480 

ttgctgtata aaatgtcccc tgcacatatg ctgctgtgta gtgctcctag gtgcatgagg 6540 

ctgccccacg ccttacagag agaatatgca tgagaggctt tattcaggta tgagttatag 660 0 

cgtagttggc catgaattca atgttaatga atcaacaata tacagtaaat aaggtgcttt 6660 

ttagagacag ggtctcactc tgtcacccag gctttagagt ccagtggtgt gaccttggct 672 0 

cactgccgcc tcaacctcct gggctcaagt gatcctccca cctcagcctc ccaaactgtt 67 80 



-32- 



WO 02/092000 



PCT/US02/14877 



gggattacag gcgtgagcta ctgcactcag cctaaataag gtgtcttaga aacacacata 6840 

agacaaggtt atgggctgag tgcggtggct catgcctgta atcccaacac tttgggaggc 6900 

caaggtggga ggttcacttg aggccagaag tttgagacta gcctgggcaa catggcaaga 6960 

cctcatctgt atattttttt aaatcagaca ggtgtggtgg tgcatgccta tagtcccagc 7020 

tactggagag gctgaggcag gaaaatggcc tgagcccagg aggtcaaggc tgcagtgacc 7080 

catgattgta ccactgcatt ccagcctggg gtgacacagc aagacgctgt cttaaaaaaa 7140 

aaaaaaaaaa aagccaggtc aggtatcgaa cagttggcaa aaacgttgtg acctgaggct 7200 

cacaggaacc tagcccgatg tttcccctag gagcaatggt tcagtattca ataattcagg 7260 

gttcccagtg actttatgga gcataacttt caagaataac aagaaccaac tgtacgtgtg 7320 

tatgtatact cacactttta ttttatttta ttttattttt tgagacagag tctcactctg 7380 

tcacccaggc tggagtaaaa tggcgtgatc tcgactcact gcaacctccg cctcccaggt 7440 

tcaagtgatt ctcagcctcc caagtagctg ggattacagg tgtgccccca caaccggcta 7500 

atttctgtat ttttagtaga gacggagttt cgccacattg gccacgctgg tctcaaactc 7560 

ctaacctcaa gtgatccacc cacctcagcc tcccaaagtg ctggaattac aggcatgagc 7 620 

tgccgtgcct agcctacata cacttttata cacacatgca tctatgacta tttctctatt 7680 

tctgtgcatg tgtgcgtggc agtacctaca gtttcagcta tgtgtctggg tactgtctcg 7740 

tccaagtttg taagcacctt ctccaaagtg caaagcctgg cttgtgttac tatccatatg 7800 

tttacttatt tgctcaatca atttacttat tagctccata accagcttcc catctgctcc 7860 

agtagcctct gctgtcagtc acctctgcac cctaccccac cttgcttccg gatgctggat 792 0 

gccaatcacc cccgacacct ctacatagca ccaccctcga catgctgctt ctttatttct 7980 

tatttatttg tttgagatgg agtcttactc tgttgcccag gctggagtgc agtggcacga 8040 

tccaggctca ctgcaacgtc cgcctcctgg gttcaagtga ttctcctgcc tcagcttctc 8100 

aaatagctgg gattacaggt gcccaccacc acgcccagct aatttttgta tttttagtag 8160 

agatggggtt tcaccatgtt ggccaggctg gtctcgaact cctgacctca agtgatccac 822 0 

cttggcctct caaagtgctg ggattacagg tgtgagccac cgcgcctggt ctgcttcttt 82 80 

aaatgccagg caccaacatt tgtgcaatgg ggtgggagga aagaacaggg aggagagcac 8340 

actgccggcc cctgcactga atccactgat caatctgggg gcaactgcca tctccatctc 8400 

ctgtcttcct atccgtgaac atctactgca gtcctctcca atgtccttct gtaaagttgt 8460 

attatgtttt gcatacaggc cttgcatatt agttctcaga tataatccat atactttata 8520 

taaaattcaa accacattta aaaaaataaa actagcatga ctataacgga gtctgcaaca 8580 

ttctcacaga ctttatgata aaacatgaaa cttcaaagat acttagggtg gggcagggac 8640 

aatgtttaag gctgcctgga agcctcccca tccctgagcc agaaagtcct atctcccctt 8700 

caaggggaaa tgcttgaaaa agcactgatc aggctaaaat gacagggatc agggagtaat 8760 

caaagtacaa gtgagctggt ctcctccatt ctgagcacag caaagttcag tctctccaag 8820 

tccaagaatc atacacctgt ttgccaagaa tgaagttcag gtgtctacaa gtggctgaaa 8880 

atattcattg ctgggccatt aacaacattc ttggcaaaac cataccttag cttctcgtgg 8940 

aaatttctta aggtagaaga aacaggaaac acccaggctc gcttttatgt agacagttcc 9000 

atgaagccag ggaccttccc cacatccacg tttcaattac ctgcacgcag ctcacagtgt 9060 

attcaacatc tacgcgtctc tcctactggg gtggcggtgg ccactcaaac cctcatgcag 9120 

ctacgatgac cgcaattttg gcaacataat ttcatgtttt tccttgggct tttacccaag 9180 

tcagtgacac aattctgcag ttgtctaaag attcaaaatg agggacttga catttacaac 9240 

aataataaaa tcttgggttt cctttaacca agcacatgtt ctgcctttta gagaaagctc 93 00 

tgcaaactca agctggagtg ggatacttgc tgacatcttc aagcacccca ggaatagctc 9360 

tactccccca tttccacctt ggctgaacca tctatatccc accaattccc ccaacatccc 9420 

tccatccgtc catccatcca cccaaggacc tgctaagcca ggaggtctct cccatctacc 9480 

ccacagcctg gcctcagccc acaagggctc tctctacatg aatcccaccg caccagagta 9540 

gaccaagtct cccgtagact ccaccctgac cacctccatg cctccagcca ttcccacccc 9600 

taaaaaccct ccctggtctc tacacccagc tgatgaatac ttggctgaat gtgacctggc 9660 

ctcctggacc caggtgaagc ccacgtcctc cgtaagcccg ccagctcacc ctgcctctgc 9720 

accttcactg gagagagccc gcacttcacc tcctcagggc aggcatggct gatgccaccc 9780 

agtggaatct ggtgcaaagc agggcccggt gcagagcagg gctgcctgca gagcaaggcc 984 0 

ctggtgctgg ggccgagcac ctccaatgct ggccgtggaa ccatccctcc cattccaggt 9900 

gctgtctcca tcaagaatga gcgagctgct gacatttgca tgacaataat gaataaatac 9960 

catattttgc ttcaaatcca gaatagatgt ggccagggtt ggcatatgac tgttgggaaa 10020 
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ggacagtttg cctcttccca aaccaacttg gattataaaa agcttttctt aacgaccaca 10080 

agagcggagg agctcagggg cagacaaaag gaaggctggc tgcagaaggc gggagagtgg 10140 

ggccttcagg ggcgggtggg gagagagaaa gcctggagct gcacccccaa ggtctgtgta 10200 

catcaggtgc tacagaataa caccacctct tccagcttgg cccccacctg ccctctccca 10260 

gcccagtcac ccagacagca ccccactccc cacacacacc tcacatctgc ccgcctcaca 10320 

ctcaccagct tcggctctca atgcaacctg gaacctgccc ttggcctctc agctcagcca 10380 

cccccattcc tgttggcccc tggcccccca tcgaattctc tctaatccta atgcacacac 10440 

ttgcacactc aaacacacac acacacacac acacacacag cccagaggaa aaccataatt 10500 

gactgaggtc caggcaagtt tcccgagcag ggaccacatt tcaaaggtca gggaagcagg 10560 

cgaacaggaa acatacaggg ggcacgtttg ggggtggagc aggaaataag aaatcacttg 10620 

caaaagataa aaagaaaatg aggtagctgg tttcagacac ctcggagcac acagaacagg 10680 

acaggcgcct ccgggtcttc cctcaacagg gagatgggcc aggcaggtcc ctgctgctcc 10740 

accgcagagc tgggggctat ggccctgaca ccaaggccct ggggcaggcg gggaggcagc 10800 

tgttctcctg cctgtgctcc cgggcagggc ctggccccac aagggaactg gccgaaggct 10 860 

ctgcttggct actccggaaa gtcctgggag acaagcaaag gacttgctag gtcactccaa 1092 0 

acggcccaga tgtgacaact gtgaagaagc cacaccaaag caaggtgaca gaacaatgtt 10980 

ggtgacgtca ggttatcagc ttacgctcaa ctccacttac ccggactcac ccgtaacctg 11040 

ccgtctcttc ccaaccagta aaggatgcct aggtagaggg gcacaaggcc tggagcataa 11100 

ttaccatttt aaaggctctg agaagtcctg cggtgaggaa gcctagttca ctttctctcc 11160 

cctaggattt cccaactgcg cctgatcaca gaacattttt tcatttccac tcaggaaaca 11220 

tattttgaaa aacactggcc tagaggcaga agtgaaatgg aaaacacaaa agtaaaactg 112 80 

aacaggaggc actgggcaga gaacggtcag aggcgccctg aatcctggac cggtggagat 11340 

ccccagcfctg gcatgcfcccc ctccctgggc ccagaccgcc tccccccatt tcctggataa 11400 

gaaggctaat gcgcatcagg gtgaagggct tgcctgggct acacccccag gctcgcccca 11460 

caccaatcgc gctcctgcga gagccagtga ctttcttgat ttggctactg tggaattgtt 1152 0 

tgcaactaac caccccagat acagatacaa atgacaggat gatcagatgt aaaggaccca 11580 

caggtctctg tgatacggct tcatgcagcc agcatggcta gtgccgtgca gaatgagaat 11640 

gaccccaggc aagtccttgc ctcccagacc cagaacccca tggagcccac cagggctggt 11700 

tcacaagcac tgtctgggtc gggcagagat tccagcaaga ggagggaaca tccatgcacc 11760 

ggagccagtt accagaagca aatcgcctct tccaaaaccc aggctattaa tggagtccac 1182 0 

tgttgagtgg agctggggtc tagctatgga atactgcaca gcagagatct tcctgagaga 11880 

aagcagtttt ccctgaaagc catgtgtcct ccactaactg tgttttaatt gggcgaacgt 1194 0 

ctgtatctca ttgcagtggc cgcgcatgtg ctgacaaggg gctgggggcg gggtggggag 12000 

cagaagctca ggggcctggg agggaaggaa acaggccacc agggctcccc agaaggcatg 12060 

tatctctctc acaaacacac gcatgcacac acacgtgcac acatactctg caagccctga 12120 

gttagcaact gtggaatgtg accagctcag tgatcccagg acaagctgct agggaatatg 1218 0 

acatttgatt gatgtctgca aatgtgcgtt ttcactaatt agaaggttta gggcagagca 12240 

gagaaaaata tgtatttcag agtcccagtt tgacctgcca gaaaccagcc cattactaac 12300 

attcttattt tcaacaaaat atagcattct gattacatac catcttggtt ccacgcctcc 12360 

tgccttgcca agcccccgga agcggcccaa ggccatggca aatagtgaga gaaacagttc 1242 0 

cagggtggag actgactcag gggtgtcagt cagtggggcg ctgatggccg gtgggaggcc 12480 

agcagtcatc accctctcct tgggacagtt gagtagctct cccccagggt catgtggcca 12540 

ctcaggttca tatgggaggc gagaggagtg gcagagtcca ggagagtggc tccgaagtca 12 600 

ctgttccctc caggcctcag tgtcttcatc cattaaatgg gtaggctgag gtctgggatg 12660 

acaaggaggg cttgcactta ctgaaaccca tgggaggctg ttcgccgatt tcttttattg 12720 

atggaagaaa acactcgtat aattcaagta ccaattaaaa ggcaggcact ggaaccaccg 12780 

tctgccaatt cctagttttg cctataccaa atttgagcaa gttaattgac ctctcccagc 12840 

ctcagtttct tcgtctgtaa aatgagggta gggatggccc ccagcccaca gggcagctgg 12900 

aaggattaaa gaaatcaaac atctcttaga gcccacctgg cacactgtga tacacaacaa 12960 

atgttagcta tttttgtcta tgaagtctag attttatatc ttgggtgttc taaagcagga 13020 

tacatttatt taaaaacaag gattttcatt aaacacgtac cccacagaca gcaaccccat 13 08 0 

ggagactgct cttaattcag gccagtatcg aaacgactct aactacaagc tttatacagg 1314 0 

tctcttggct gtccttcaaa tccaactaag gtggtacttc tgaagcactg tgcacatgtg 13200 

tgtgtgcatg cacacgtgtg ggaagggcgg gctcacggat ccctcaggta ccccacccac 13260 
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gcagtctcaa gtcacaaagc gacagagcag ccgaggaagg tctgtgcccc actggaccct 13320 

cgtgaagcca ccaactctac ctctgcgccg tgtcctgcag actgggctac cctttgggtg 133 80 

gggaccagca tttgatgcaa gaaaggcaga cagaaaagga aaagggcaag ttcgactcca 1344 0 

gataacacag acagtaccaa gccccagggt ccataaatgc cacgcagatg gaagcattta 13500 

ctgcgaggcc acacagcaaa cgcacggatc cagggacgga ggtgcagact gcggtgcccc 13560 

tgagccatga ccctgcaaat taccaccatg ggaaaggagg ctgccaaacc ccccgacagt 13 62 0 

cggctgggct ggcacagact cgtggtttcc atcgaggtgg gaggaggtgg gacgtcccag 13680 

cccctccccc atgcccactg cagagggaag cggccgtttc ccctgtgtgg ttacaaaggt 13740 

ctcattgttc ttcctcacag ggaggaaact ggaggaccga gctcagaacg cattttagaa 13 800 

ctggcagaaa agaacatctg gggaaggaaa cacatttcag aaacaaacat acctttgtac 13 860 

cagcttttat tttctttaag tgttgaaaaa ataataataa taaagacatg ccaaatttat 13920 

catcgctcta caaaatccct ttattgagca aaacgtggca gctctacttt caaatgatta 13980 

ctgttcctgg aaaattgcag caacgtggat gccaaggccc gaaggccgcc atcagcagcc 14040 

aaacaaaaga tgccacctcg ggctccgcga cactgtacca tgccagggaa ctggacagat 14100 

ttggggaatg ccacggtttg cctttaaccc cttgcctcct ggtctcctga tgcatctcag 14160 

aggctaacat tctttgagga actggcattt cttagttgta aatatgcatg tgggtttggg 1422 0 

agctgcctgc aaagtccagt gttgacgatc agctttgatt tccttggaat caagtttacg 142 8 0 

tgtcgagtct ggaagttaag aagaatttgg agaagctgag cactatggtg fctgcaggccc 1434 0 

tgggtgaact cttccaccaa gcattcattg tggactgaca gcgtgcgagg ggctctgcag 144 0 0 

gcaggtgcac aggacgaaac acattccgtc cgggggaaac ctgcaggaaa gctccctctt 14460 

cttcctaagg tgccgggcct agcttcatgg gtccctaccc tccacgcctg tcacactttc 14520 

tgagtctcat gtgggagctg cttctggttc ctgacttcac tcagtcctca taggaggtgg 14580 

aactactgtc accccatttt acagatgggg agactgggca caaggggacc aagaaaccaa 1464 0 

tgcaaagtca cacttgtggg atcagtgaca ggggagatca attcccaggt tctttctgca 14700 

agagttaaat tgttttcatg ctgcctaagg gggggcaact gaaagaccac tgcatatctt 14760 

tgccaaaagg gtcaagcaca ggagccgcag ccagtgggtc agatccgcag aggcgctggg 1482 0 

gtgaccctcc ccatacctgg agggatgctt gtcccctcct ggccttcact gggtcccctc 14880 

atgaccgtgg cctcccagga cctcagcaca atcccggtcc tgtgctccag gacaagccct 14940 

ccgtccccaa gactgtgagg aaatggaacg aagaggggct cgctgcagcc cagcacccac 15000 

actgcccctt ctcaggggca agaaccgtcc tggaggactt ggctttggag ggggagcctg 15060 

ggaggccagt aagtcaacaa gcctctactg ctcatgggtg ggatcccacc gcaggccccc 1512 0 

acctgctggg gcgggcaggg acgggcggca cagcttggcc agggcagata acccccacct 15180 

tggccagggc gaaggcagga cacgtgggct ccagcctggc cccaccatcc ctgcacaaca 1524 0 

ctgggcaaag tccacgtttt cctcaactgg gtgttgacat ctgcaggaca ggggcatgga 153 0 0 

ggtacagagc gctgaagcca cacagcaacc taggagcgag actccatgcc tccccgggga 153 60 

cccctcccca ccatgaggac catgaaggct tcccatgtgc cgcaaggact ctggtgtgga 1542 0 

gacacacgtc tcctacacag ccaggcctaa cgctcttgta actgggtggt cccacctggg 15480 

ctcacagctg gagggccagg agctcaaggc ttcgcagggt ctgctctcat cccagaggcg 15540 

atggggagcc acagcaggct gcaggagaga gggtgggccc cctccacttc agaggcccca 15600 

tctggcccac agactggaga gcacatctct cagcaaccac ggagcgccaa ctgcgcacag 15660 

ggcctggtcg tcagagcggg gcaaaggcac tgaccgtcac ggccagggcg agggaagacg 1572 0 

ggtgggcagg gaccttgggc agagggggaa gaacctggtg cccaggctgg ccctgccttc 1578 0 

agcagtgaag ctgagtgggg aggcgctgat gcagggggcc agaaagggct gctggtcagc 1584 0 

cgggaggagc cccccacaga ggaagcagcc agcccagacg cagatggcag ggtcccctca 15900 

acaatgtcct ctgaaaagga gaggcgggga ctgctctggt gacacctaca aatagatagt 15960 

cagccctcag ccccctgcca tacttctgac aaagcagagg cccccagggg aggcgcaccc 16020 

gaaggtacct gcacctgtcc cccagactcc tagagcccac ctgaccccat cccaccaggg 16080 

ctccagctac aaaataaatg ccgaggccag ctaggcaagg acgcacactc ggtaccgact 1614 0 

gaataggctc cacgttgtca tgagcgcaac ccacaggcca ccaggccaca ctatgcagag 16200 

ctgagatggt ttcggccaag cagcctctca gctgagctga acaagtccag agtccccggg 16260 

gggtcgtcac tatggagtaa caattgcgat gcgatggtaa ccctaacagc taaccgtcac 16320 

tgagccaggc cctgagctag gtacttttca acgctgcctc tctgcagcct caggacgagc 163 8 0 

ctgtgggagc ataaagatca ttccctatca cggatgggga aactgagctc tgaagcagtt 16440 

aacgtgcttg tcccagaccg cagagctagg agcaggacac aacagcaggt caggcaggaa 165 00 
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cgggtgaggg gggcctgcat gggcttctct ggaggctgcg catacacgca acccccagga 16560 

ccccgaccct gcacctgcag ctcgctactg ccccctcagt gactccagca aacctcgggg 16620 

taggggaagg aggctgggaa tacctcgggt gtccgaaaca gcagcttctg cttggaggcc 16680 

actgctgcat aatggttgct gcccagcaca ccccaagcca cctgtgccac ctgtggtgac 16740 

cttccagcat gccttggtga ccaagctggc cttaggtgct gtgggcagcc aagaatagaa 16800 

cagggcccac ccctcctctt cacactaaca caaagcaaga ggcgggcact tcgactgagt 16860 

gcatccctct agctcaaggg cctcacggat cacaggggtc agggcaagat cccaattctg 16920 

cattcccgtc tgcctttcat cctgctctgc caacaacagc cagtgaggct ggggacatcc 16980 

ctgaacctgt ttctcacctg aaacacatca taccattgga ccccagccct ccgggagagg 17040 

ccctaatccc tgactgtggt gagatcagat cactggttaa gtacccagaa gggccttggt 17100 

caggggctcc aggggtgggg ggtgatgggc gtggtggtat cccgctctgg gctatagtcc 17160 

accctgatgg aggaggtctg tggtcagaac cgggctgtgc agggcacagg agcccagagg 1722 0 

gacccccaga gctcacctgg tggtctctga gcagggctcc ctcaaccctc agagaaaagc 172 8 0 

acagcaagga ggccgcccag agcccagcgc ctagcaccca gtggcgtgcc agacctgcct 17340 

ggatcctgga gatctctcat caccctccaa gtcagtcatg cccaacccag ggacccacag 17400 

cccacggggc cgtgaaggtg tgctgagtcc aagaaggcct tcgacactgg gaagccaagt 17460 

ggcacctcct ggtgtggagc aggcggaatc ccaccagcct ctgctctgcc agtgggcaca 17520 

gctggacgat gagcagaagg ggctgttgct taataaacgt catttcctta agaggataaa 1758 0 

acctttcaaa acagatggaa attttttttt aattaaaact ggtggccaaa gagatggaaa 17640 

gcaccccttg tgcctccctc ccatcgtgac ccatcctctg cacacctcaa gctgttcgct 17700 

gcccaggtgt ctcctgaggc actgggggcg ggtgagaatc cgtgagccct cggccagccg 17760 

tggctctctg gagctctgcc ccaggccatc agggcacacg ccgggcaccc tgggggccac 17820 

acagggcaga gcccagctgg gtcagcacac agggccacac tgggcacaca agtctctgag 178 80 

cctcccctgt ggacgcagct ctcactatcc caccccacta ggtcccgggg atctgtccca 17940 

cagggtgata tgctgtcaca gaccactacc agagccatgg cctgctgttc cgcccgcagc 18000 

caggtagtca cttgctccac agggacaggc aacgccgcac ttgggggctg ctctgcggca 18060 

ggactagagc tccagcagct cagccctcct gagaaggaga actccatgct ctaagaggca 18120 

gacgcagcgg acggcaccaa agccaccaca agcccacggg gccctgcatg gcaggtcagg 1818 0 

agtccctgac cactcgctct ttgtaaccag agctgcagtg gagtctacga ggcaaggact 1824 0 

gtgggcggca gtggccacag caaatgaatg agtgtcccaa gggagcaggc ggctgcgggg 18300 

aggcacagcc gggacccagg agtcctccgg cactgcagca aactccctgg gccccctgag 18360 

cagcgaccag gtggcaagtg catgaactcc cgggggcata acctgggagg gtgacactct 1842 0 

cttcgtgttc aaattcttga gaacgcatta aaaatatcac tcagtcacct actctatagt 18480 

tttaactcaa aagtaccaaa gtagccaggc gcggtggctc acgcctataa tcccagtact 18540 

ttgggaagct gaggcaagag gatcacttaa gcccaggagt tccaaatgaa cctgggcaac 18600 

atggagggac cccatttcta caaaaaaagt gttttaaaaa attacctggg cctggtggtg 18660 

tgtgcctgta gtcccagcta ctcaggaggc tgaggcggga gaaccacatg aacccagggg 18720 

aggtagaggc tgcagtaggc tgtgatggca ccactgcact ccagcctggg taacagagtc 187 80 

agactctatc tcaaaataaa tttaaaaagc accaagccag gcttggtggc tcacacctgt 1884 0 

aatcccagca ctcagggagg ctgaggcaag tggatcacct gagtcagaag ttcgagacca 18900 

gcccagccaa catggtgaaa ctccatctcc actaaaaata caaaaattac ccaggcgtgg 18960 

tggcgggtgc ctgtaatccc agctactcag gaagctgagg caggagaact gcttgaaccc 1902 0 

aggaggcaga ggttgcagtg agccaagact gtgctactgc actcaagcct gggagacaga 19080 

acgagactcc atctcaaaaa ataaataaat caatcaaaac caccaagact ttttaatata 19140 

aacatttatt attccataat tccttttttg catgattaaa aatgtttata taaagtttcc 19200 

tgaaaatggt aagaatgcca agtgaaggct gcaaatgccc aagcccccac cgtggcatct 19260 

cacggagtct gggccctagg aggctggtgg gtaccacgtg gacccgagac ttcacagtca 19320 

agtccctttg gggtacactg ggtttcccac accccagaaa tatgggctct tactgcagga 19380 

ccatgggggt cctcacactt ggcccagaag ctgtcacata gccagacagg tgttctacaa 1944 0 

cctaggctag agggagctca tgctccagca gaattcgagc cagaggaggt aaaagatggg 19500 

taagatctgc tccctggaca gatgaggcct tggcctcaga acagttactg atcatctacc 19560 

agacatcaca ctagaggcag aggggcgcag acgaagacag cccctgtcct caaggccctc 19620 

ccaggttggg tggaccatgg aaggttccag acagatctgg caagagaagt gcccacacca 19680 

ggggcagaag atgggcaggt ctgctcaggg cggcacggcc tgccaggcca aaaagttcca 1974 0 
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acttcagatg ctggagaatg ggcacgactg tctgagaaag ggaaggatgt gatgaaaact 19800 

acttggagaa aaattaatct ggccagagca taagataaat gggcaaaggg gaggttccag 19860 

aaagcaagga gaccaagtaa aagctgatgt cattggctct gaatctaggc tttcactgaa 19920 

tatgcaccgc agggcctgta ggtaaagcct cagagcccag ggagtctgag tggaggagag 1998 0 

ggcaggggac agagctgggg cctgtgtcta cagtgctcag gaggaatagg catggacgtc 20040 

agctcggagg ctccagctga agtgaggagg cggccagggc agcacggcca cgcccggatc 20100 

cagactcctt ttgggaagca agttcgctct gggggaaagt ttggagaaat ggcctttacc 20160 

cgcagaagca agccccagaa catatcttgc tccaaaacta tctcgtacag tgaggacgtt 2 022 0 

aagcttcagg tcccctagag gagacagtct gctccttcct ggggcagaac ccaaggtggc 20280 

cagagcctgg aaggcaccca gcacccaggc tggtgtgttc cagcccaggc cacacgctca 2 0340 

gatagctatt aatgccccgt tgagcaattt cctgagagct ttgccaggca ggtaccgcct 2 04 00 

ccccatctga actaatacag gggtacatcc caaggaagaa atgaaaggtg cccacatttt 204 60 

gctctgggat taactaggga ggggagtgat aattaactca gtaattatat ttgccatcgg 20520 

gctaatgcta aaattagtgt gcattagaat ttctttcctg agcagacacc ggagtgagtt 20580 

gggcagcagg agtggctcgg gcaagtcggc acaaagggca cctccagagc cttccacaaa 20640 

tgtcagcaaa acccacaaat gtcaaggccg gctccactgc acccagcaga tgaattcact 20700 

tccacagcct gagaccgcca gctcatcgga ggccatttaa aatccagccc tctgacacct 20760 

gctggatatc accatttacc gtccccagat caagagatca aagggtggaa cctgatagga 20 820 

cggctctgaa gttcaccaca aaagcataaa cgtgcaagca gagccaatac gtcttttgaa 20880 

aaggacaatg aggtgggaat ttacataact gatcttaaaa tatgttctga tgcttcagag 2094 0 

atggagacag cagcattccg gtacacaaag acactcacag gcagtggagc acagtgaagg 21000 

gtctggaatc aggacccagg tgtctgtgga cactacacat aaaagagcag catttacaat 21060 

gaatggatag gatggaccat cccaccaagg tgttggacaa ctccctattc actggccaga 21120 

cccctacctc ataccatata caaaaaaaaa aaaaaaaaaa aaacccagac agaataatgt 21180 

ctgaatgtaa aacataaaac agtaacagtc ctggaagaaa ataatggagg atatatttat 21240 

aatctggaga tggagtaaca agggatagga aaaaagccat agggaaaaag tagagttatg 213 00 

attatatgaa gcttcttaat atctttatga taatgtacca ccagaaacaa ggatgaagga 21360 

ctagctacag accagcagtg aaacctgaaa caaacagaac aaagaattaa agtccatacc 21420 

aaataaagac ctcccacaaa tctataagaa aaagataaac aggctggcac cgtggcttat 214 80 

gtctgtaatc ccagcacttt gggaggcgga gatgggtagg tcacttgagg tcaggagttc 21540 

gagaccagcc tggccaacat ggtgaaaccc tgtctctacc aaaaatacaa aaattagcca 21600 

ggcgtggtgg cgcatgcctg tagtcccagc tacttgggag gctgagccag gagaacagct 21660 

ggaacccggg aggcagaggt tgcagtgaac caagatggca atcgcgccac tgcactccag 21720 

cctggaggac acagcgagac tctgtctcaa aaaaaaaaaa aaaagaagaa gaagaaaaaa 21780 

gaaaagaaaa agacaacaga aaaatgggcc aaggataagt gtaggcaatt tgcagaaaag 21840 

taaataccaa taaaccagaa atgagggttg tgcaaatcaa aaggtgttat aatttttaac 21900 

caaactggac caaagaaaac accaaaaacc aaaatcttgt aattgccagc atcagagagg 21960 

atataggaaa gtgtgtgttc tcgtagatgc ttgcaggtat gaactgctac agccttttag 22020 

gagttatgta tgtatgtatg cttgtatgta tgtatttgag acagggtctc gctctgttgc 22 080 

ccaggctaga tctgttgcag tgctgtgatc atggcttact gcagccttga cctcctgagc 22140 

tcaatagatt ttcccacctc agcctttcaa gtagctgaga ctacaggagt gtgcaatcat 222 00 

actcagctaa ttttttaaat tttttgtaga catggggggt ctcccaattt tgcccaggct 22260 

ggtctcgaac tcctggactc aagtgatcct cctgcctcaa cctcccaaag tgctgggatt 22320 

acctggatga gccactgtgc ccggcctcaa tatctttaaa aacagaaatg gacacactct 223 80 

ttgactagga atgtatccta taaaaacact tatacacatg cagagacaca cgagcaagca 22440 

tgctttgtaa tagcaatgaa ggctggaaaa actcctcaat caggtaaatg ctgtcaagtg 22500 

cacctgtgta ctatgaaatg gcacttggct tttaacaaga gcaaagacag aaaagcaaaa 22560 

gtacaaagta gggtgtgatg gcacatgcct gcagtcccag ctactcagga ggctgaggca 22 62 0 

ggaagatcct ttgagcccag gagttggagg ccaggagctg ggcaatagtg agaaaaaata 22 68 0 

aaattaaata ataataataa taaaataggc tgggcacagc ggctcatgcc tgtaatccca 22740 

acactttggg aggctgaggt gggaggatcg cttgatccca ggagttcaag gccagcctgg 22 800 

gcagcaaagc aagacaccca tctcaacgac aaattttaaa aaatcagcca ggcaggctgg 22860 

gcatggtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcagg cagatcactt 22920 

gaggtcagga gttcgagacc agcctggcca acgtggcaaa accctgtctc tactaaaaat 22980 
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acaaaaatta gctgggcatg gtggcagatg cctgtagtcc cagctactga ggcacaagaa 23 040 

tcgcttgaac cagggtggca gaagttacag tgagccgaga tcgtgccacc gcactccatc 23100 

ctgggcgtga gtgagactcc tgtctcaaaa aaaaaaaaaa aaaaaaaaca aggagccagg 23160 

cacggtgggg tgagggaggg cacagaagca gcgcctcttc tgggggcacc cccaatctct 23220 

agcgatccag aggcctcagg atcctgaagg gagaaaaaac gtgaagctcc gtgctagaag 23280 

agaccataga gattggaatc agctggttct attttacaaa aaaaggaaac tgaggccctc 23340 

agaaggtgag tgcctctcaa tgccccacag ggaggcaggg agagggctct gagccctgca 23400 

gggccctgga ttcttgcaat ggggtggagt ggagcctgtg ccgcccccac caggcacctt 23460 

ctcaggagag gagccgttgt catatccttg aaggggtcct tgagcccctc aaaaggctaa 2352 0 

aaaccacttt cctccttgag tgaaccttca cctcagttta accacaagaa aaactacatt 23580 

aaggcccagc gcagtggctc atgtctgtaa tcccagcact ttgggaggct gaggtgggtg 23 640 

gatcgcttga gcccaggagt tcaagaccag cctgggcaac atagtgaaac cctgtctcta 23700 

caaaaaacaa caaaatcagc tgggcgtggt ggtgcacacc tgaggtccca actacttgcg 23760 

ggctgaggtg agaggattgc ttcagcccag gaggtagagg ctgcagtaag cggtgactga 23 820 

atcactgcac tccagcctca gcaacagagc aagactcaaa aaaaaaaaaa aaagcaggcc 23 880 

gggtgtggtg gctcacgcct gtaatcccag caccttggga ggccgagcgg gaggatcagg 23 940 

agatggagac catcctggct aacacggtga aaccccgtct ctactaaaaa tgcaaaaaat 24000 

tagccgggcg tggtggcggg tgcctgtagt tccagctact caggaggctg aggcaggaga 24060 

aaggcgtgac cctgggaggt ggagcttgca gtgagctgag atcacaccgc tgcactccag 2412 0 

cctgggcgac agagcaagac tccatctcaa aaaaaaaaaa attaaatctc aaaaaaaatt 24180 

acattaaggc aaactaaaag atgtttaaaa tatatatatt aaattaaata cactccaata 2424 0 

gagcaaatac gaaaataccc agaaaacaca atccccgcac ccccaggaca acctcccagg 24300 

gggtccacag caagagaccc caagcacgag agacagagaa cagtgtccct gtggcggaac 24360 

ctctggccca tcaggctcta ttagaaaata aggctcttgc cactgagaga aagaggcaca 2442 0 

gtcgcccagc agccacgggc tctggcacac cacgagtcag gccagcaaag tgtcaactgc 24480 

cccctacaag gtgacaaact aggacaaact ggaaaccaga ggctggacct ggagcacagg 2454 0 

gaccaccaca tggggctggg gaatgggcag ggacctcaga gcgccaccca catgcctaag 24600 

agcagcgcgt atgcgcatgc ctctgcatgg cttagggaca cagggagctc cccccacccc 24660 

caacccagga aggcagcccc cactacccag gtagggaacg gataggacca gcaccccgtt 24720 

ctgctcgtaa ctcagggctc caggccccct cgggggcaac cagcacagag ctcagacccc 24780 

aaatatcttc acccacctcc tggtccccat ctggacaagg gtgctgggga ctggctctca 24840 

gtcacaccct cggggtactc ttcaaaggac agctggatgc cccagggcag gagcttttgg 24900 

cccccagctc cctcacccca gacaccagct cttgggaccc caccagcatg ggcaaggtgg 24960 

acaccatcgt cccgattttg cagatgagga aactgaggct gagggctggc acacggctct 25020 

ccagagctga agagaatgca gagagcagcc ggagccagcc ggtgggtccc tgaggccggc 25080 

tcgtagcaag ccacagctgc ctccgcccat cacacttgga cctcactggc cccaggacag 25140 

ccctccaggg cggcctggca cagagcccac accctgctgc ttcctgaaca aataagtgaa 25200 

caaggccacc aagccgagga cctggatgta gccccggctc ccgccagggc ctccccaaca 25260 

gactccccat ttggagagcg cattaagtgt ttccaaagcc tcacaaacca cagatgtccg 2532 0 

gctgtctcac ggcttctgta acctgaactt ggccctcact ctgccctccc agcactcctc 25380 

tcagggccca ggcccctcct ctgagatgcc agcactgact ccccaacttg tccccatcac 25440 

ctggctcgtt cctgaacctc ggcaggagag tctcaggcca gatcctccca ccagccacct 25500 

ccaccaggat gcaggaggca tgagacctgc tcgtgccggc tgggagatgc aaccaaccaa 25560 

gatcaatcca atcagcggat gaactgacaa atataatgtg gtccctccac acaatggaat 2562 0 

attattcagc cacaaaaagg gctgaaatag gccgggcgtg atggctcaca cctgtaatcc 25680 

cagcactttg ggaggccgag gccggcagct cacttgaggt caggagttca agaccagcct 25740 

ggccaacatg gtgaaatccc gtctctacta aaaatacaaa aattagctgg gcgtggtggc 2580 0 

gggcacctgt aatgcaagct acttgggagc ctgaggcagg agaatcactt aaacccagga 25860 

ggcagaagtt gcagtgagcc aagatcgcac caccgcactc caacctgggc aacagagcaa 25920 

gactccattt caaaaaaaaa ataaaaggct gaaacaccca tacgtggtac tacttggatg 25980 

actcctgaaa acgttacagt aaccaaggaa gtcagccacg aagacgcatt gtaagattcc 26040 

cttcatgcaa aatgcccaga acaggcagaa ccacagaggc agaaagtcga ctggtgttca 26100 

ccaggggatc cggggagagg gaacgggaag tcaccgtgta atgggtatgg gttttatttt 26160 

ggggtgatgg aaatctctta taacttgata gaagagaggg ttgtaaacac tgtgaatgta 2 622 0 
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ccaaatgcct gccttctata ctttaatatt ttatattata taagtttcac ctcaatttaa 26280 

aaaaaaaaca actcgacacc tttcacctag gaaagatctg gctttagctt gcatttcctg 2 6340 

taactcctgc ctaaagcctt ccagaagctt ccgctgcctt gtggatcaca accagactcc 26400 

acaccatgat ctggcctcta agggcctctc gcaggacacc ccgagggtga aggagcaccc 26460 

gtgggcccac ctctgcatag ctgcaaagct tctttccctg tcctcccctc tacatgggaa 26520 

gctctgcccg caggggcggg gccttatctg ccattctatc gcactcaacc ctagcacttc 2 6580 

actcggtagc agacaccaaa gcaaaacagc aacagcatta taccgggcca ggtgcacgtt 26640 

aactcactga attcatggta ggaaggattc tattcccatt ttacaggtga gaaaactgag 26700 

gcacacaaag gtagcatcag cttcctaagc ctcccagcac aggaagcggc caggctggaa 26760 

tcagaccctg ggcgcagggg ctctgtccac agtgctaact aactactcct gcccccgagg 26820 

gctgcagcgg tgagtgagtg agtttgtcag tggactggat gtccaaggtc atacaggaaa 26880 

aatccagact attgtaataa cagcctctag accggctggg gccagaaaga tcgaggacgc 2 6940 

tgacacacaa ctgcgctcac tgcagctctg ccagggatgg ggctaaaggt ctcacacagg 27000 

gcagttaggg ctccccatag cctgggagag gaacggggtg agataacaga aactaggtat 27060 

ggtgcccgaa gtcaaacagc cactgagcat gtaaacccag gtgggtctga ccccaaaccc 27120 

ctccaccccc atcagccctg caacccgtcg ctgcaaggga gaaagcaact cagaggcctc 27180 

acctgcctac atcccccacc cgtgtgtgtg agttctacta aatgcctgag cagtgacaca 27240 

gcacggctga aattaaacgg gttccaaaaa cgacaggaag cacgaagtga atctccccag 27300 

gaaagtgctg aacaaatgct ggatcgggtt caccggcgaa tttcttggaa ctgaagaggg 2 73 60 

gagctaaaca cacggggccc tgctttggag gggactctct cagggtgctc cacacagcac 27420 

ttggttaacc ccactcagcc cttctgggct ctcccagagg gcccggcctt ggccttgggc 27480 

atctacagga ggaacctcca gggggagagg gggtgcctgg acaggccggc cctggaacaa 27540 

gcacttgggc cccgaggaga gaggactagg gcttgggagc tggggaagtt ctcagcactg 27600 

ggaccactag aacaaagcca tttccgtgcg ttcacagctt ccaattgcaa caggaagcaa 27660 

tcaggaaaaa taattagcgg cccacttact ggcttcgctg aggtccgagg catgtatttc 27720 

acacagtaaa accagggata taacatcaaa accgttctgc agaaagattc ctccctttcc 27780 

ttccatttta ggcctggatc accacattca ctggggctcc caggccttgc tgcctaatgt 2 7840 

taaaataatc aactctattt ttgcctcaca cacaactgaa ctctacagct ataattcttt 27900 

ctcctcaggg gctcgaacca catggacgac aggcatttga ctccagcaac atcaccccaa 27960 

aacgtgcaca aaacccaaaa ctgcaatgag gtgaaaggca acgcggtcgg cctagaaacc 28020 

ccccctttaa aacaaacagt ttccccaaaa ccccttttgc ctccttgacc caggcatttc 28080 

cggaaaaagg agcggcgctg gcctgtactc cccagatact gtcgctgttt tgtcttcacc 2 8140 

ttgttttgct agctccagac aaggccccac aatgtaaaca cgctcctgaa agaggcagat 28200 

ttggggtgaa actgtccata gaatctctag gcttgggtca gaggcaggag gacgtgaaac 2 82 60 

aaactccaag ctcctcctgt tccccgctgt cccccacacc tccaagcaga ggctgcagcc 28320 

tgggggatct gactacaggg ccaccccgct gcaccattca cactggaaat attcagggag 28380 

acagctgttt gccttaagga ggcccagaca aaggggcccg aggtcctccc cgctaaactg 28440 

ccacaaacag aacaggagcc gcggcgtgca caggcacttg cggccgtgcc acttggccag 28500 

ccatactcca gaaaaacaaa acacgcacat ccgaagagaa tgatttaggt agcaagaggc 2 8560 

ttgcttgaaa aaccacatgg caatctccaa attaaaagaa catgtgtagc gtttcacgac 2 8620 

tgcttaagtt tcctgagtcc tcctgacctc aactccaccc cctgggaaac accaaaagtt 28680 

ggatgagaaa gttcccccgc cctacctctc cccacgggag tgtacaactg aggcacaagc 28740 

ctgcctcccc cactgccccg cgatctggga ccacgtctcc tccgcgtagc cgacccgggg 28800 

atggacacta tctggggacc cggcggccac acggggcatt cgggtcgccc gggcacctgg 28860 

caggtgtcag tccgcttgga aacccacagc cacgcggctc acaggagcag cgccaccggc 28920 

taggccgccc cgcgcccggg ctcagaactt tctcgctgcc acttcagccc gtcctcggag 28980 

cacgcggggc ggccgcgcgg ccgctggaaa caggcttgcg aaccggctcc ccgggccagg 2 904 0 

cccgcctccg cgccccaagt ccccgctcgg tgcccggccc gggccacacg ggcccagcgc 2 9100 

gggctcggct cggctcccgg cttcccgcgg gctcgggcag gtgaggaccc gcccgcgccg 29160 

cacctggcgg agcgggcgcc ctcctcgcca gcccgggacg cagcgtcccc ggggagggcc 2 922 0 

cgggtgggga gacaaagggc ccgcgcgtgg cggggacgcc ggggacggca gggggatccc 29280 

gggcgcgcgc cccaactcgc tcccaactcg ccaagtcgct tccgagacgg cggcggcgcc 2 934 0 

cgcgcacttg gccgcggggc cgcccgggcc attgtccgag caacccgcgg cccgtcttac 29400 

acgccgggcg cgggaaggta tcgaatcagg 29430 
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<210> 8 
<211> 33769 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 

<222> (33739) , (33749) , (33758) 

<223> Identity of nucleotide sequences at the above locations are unknown. 
<400> 8 

cttcccctta cactggtcct tcgacccgcc tcggatgaaa actgaatggg tttagcctta 60 

gaggctctcg gtctctaagg gaggtgggtc aggatgccgg ggacagggtc ctcttcctgg 12 0 

ggcaacgtgg gggaacgagc cacctacccc tccactgaat tgccctgggg tgtgggtacc 180 

gacggctcat tcggtgtcca gggtctgaga tgtgttgaca ggaagaatga aaggggatgg 24 0 

gagggatggg gcgaaagaag ccacctgcag ccccaggaac tatctggcca gcacaccgtc 3 00 

acccagcggc ctgagccacc cctgccagag ccaggaggag accctgccaa tgggtcacca 360 

gtgtgcagga actcagaagg tcatcacagt taataccctc catgccccaa tgtgggaaaa 42 0 

caggtttttt cacaacaaac aagataattt ttgttatttt ggcaaaagga ggcagggcag 480 

ccccggacac ctccatccca cctcatcacc cagccgcagg gccccggcca tccctgcaga 54 0 

cagagtggat gtcacaacct ccctgcaccg aaccaagtgc agctcccagg ccacaggcca 600 

cccaggaaag gtccagtggc ccccggaggc tcccaccgca ggcctcccac cacagccggc 660 

accaacccag gatagctgtg ttctcctggc ttcttttcac acgggtagca gaaagctgag 720 

atccggggaa agctgagatc cagggaaagc tgagaatcgg cctctgctgc ccggacgccc 780 

acccccagct ctgctcccag ctccagggcc tccttctcag gtgcccttac aggaggcaga 840 

gggcttgagc cacctcctgg gcctggggca cgcaggatga acggggtcac ggtgcaggcc 900 

actgtccact gcgcagatcc caaggccata aacagcctgg ccacagtggc ttcccagctg 960 

gcaggcggcc agattatttt tgttgtttag caattgatta agtttctccg ctgcccccag 1020 

gggtaagtgg tggggcaaat gccgcaaccg cagcatttga cccgggatcc tgtgccaagt 1080 

gaccataggg tcacaaagca caagggaagt ggctgggccc gatgctggct ctgctggaac 1140 

ctgaggccgg ccactgtcac ctgcacggtg cctgggacct tccagcaagc acagagaagc 1200 

tatggccctc caggagcagc tggcaggcac cttggcctgc agtcaggggc tctgtctgct 12 60 

cagctctaaa acaggaaagt cgctgctctg cctggggtca gggcagccag agagtgacca 1320 

agtcagtgcc ggcctcagga agggacctgc aggcgggtcc cttcctctcc catccctcgg 13 8 0 

tgccagccag cccctcctgt ggccccccac tgcctgcctc tgcccccatg ccccaccaca 144 0 

acctcaggcc catggctgca tggccactcc ccaggcaggc agtggggatg ggatttcacc 1500 

atgttggcca ggctggtctc gaactcctga cctcaggtga ggagttccta aagtgctggg 1560 

attacaggcg tgagccaccg cgccagccct ccctgtggta ctaaacactc acaccccctt 162 0 

gctggggacc ctggtgaggg aacacagcct cacaagtgaa gtgtggtttt gttgagcaaa 168 0 

• tgacgcctgg gcagccctct catctttgcc taaaactgaa gaatttaggg gcgtggatgt 1740 

ataaaacagt tggtgactta aatgaaaaag aaggccacac tccccccttt aggcaggcgg 180 0 

cctaattctt taaaagccag cacagggtgc ctttctgaac ccaggcacac agtaggtgtt 1860 

caatggacag cagcggttac ttgtactgct catgacaccc tgtctgtggc ctctgcagct 1920 

ggctccagcc tgacgcatgg ctgcgcccct ccgcaaggcc accccggtat acatggaaac 1980 

tctgtggaga aggccttggg ggccggccag gacgccaggc ccagatccca tctgcgccct 2 040 

tcctccatag acctcagcga gctctcggca ccatgtgcct caggcccatt taagaagtag 2100 

ggccggccag gcatggtggc tcatgcctgt aatcccagca ctttgggagg cccaaggtgg 2160 

gtggatcacg agatggtcag gagatcgaga ccatcctggc taacacggtg aaaccccatc 2220 

tctactaaaa atacaaaaaa taagccgagt gtggtggcgg gtgcctatag tccaagctac 2280 

tcgggaggct gaggcaggat aatcgcttga gctcagcagg cagaggttgc agttagcgga 2340- 

gatcgcgcca ttgcactcca gcctaggtga cagagagaga ctctgtctca attaaaaaaa 24 00 

aaaaaaataa aaaaaagaag cagggccagc cacggacgac ccctcacaca gctcccagga 2460 

cgcgtgcctg ggtatagggc tcaggaccat gaccgctgca gtggccccca agaaacgtta 2520 

cttttgtcac ccaccccgcc tcagtggcag tagccaaaat aacggattag aatggaacca 2580 
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tgtgacaatg ccactgcccc aactgacaga agatggctat cagcagttca cgcggcccca 2640 

cctatcacaa gtgcagggca ctctacaact tatgcatcct tccccagaca ccgtcctttc 2700 

gaccctccca ggtcagcaag gcacacaggg cctacatttc acagccacac agcagagggc 2760 

tgaggctgga actcggatgc tctgatttcc gttcaatcac atccccagag gtggcacaga 2 820 

gacggggggc ttctcttgac aaagtcaaga aagtcactgc cagctccact gaagaccaaa 2880 

gaacctcagc fcctcaaaccc tcttgaaggt gttaccgaac tctcccagcc tgtttcctgg 2940 

gtcccgatgt tggtcccgtg ggacacagga agaggaagaa gctccctaga gcagagcctg 300 0 

gtgcacctgc cacactctca gagggctgcg cacgggcgga ggagccgtgt gcaggagtgg 3060 

ggtctggatg gaggggcgct gtggccgggg gcagggggca ggggaagggt gctccaggtg 3120 

gtgggcacag cacgagcagg ggcagggagg tccacactca gatgtgcaca gggagaaaca 3180 

aatcgtgcat ttccattgga ataggcggta aaaggtagaa aaacagagtg ggggccagga 3240 

agggagtcgg agccttctag tgtctctctg caggtgagcg gcagcccgag gtgtcagctc 33 00 

agcagacttg gggtccaggg gccgtgtctt ctatcactga ccccagggca cacggaactg 3360 

gggagggaga gcagaggcac agggcacggt cagtgaaacg aaacaaggag tcatcaccaa 342 0 

atgcggaaag ggcaaggagt gcccgcagcc gcacaagggt tctgtctggg caacgtgggc 3480 

gtcccaccag gccccgcacc ctgcaagcgc aaagctcgcc actgaagata aagggaagct 3540 

gttggagctg cggagctggt ctggggtccg catggagctg ggcttatgct gcagtcacaa 3600 

gggggacatg gaagaggctg caggggacaa aaccagtgac cacagtctaa ctctgagcct 3660 

gtggaaaggc gcccacagca ttcacccatc ccagagatgc cattccccct gtgcccccgc 3720 

tccacggtga cagcgttctc caggaatatg atgcgcccct ctcctcttgc atcagccctg 3780 

acagtgagta ttcaggccaa aaagcagaag agcacagctg cgtggttcca tttccatgta 3 84 0 

gttctggaac aggcaacgct aatccaaggt gatagaagtc aggagagtgg tggagggggc 3900 

gggggttgag gatggcaaag gggcaccggg aactttccca gtggtagaaa tgttctctgt 3960 

ctggaccgtg tggtagttat gcagacatat gcagctgtca aagttaatcc aaatgtacac 4020 

gttaaaatgt gtgcgtttta ttgcctgcaa gttatacctc aattaaaaaa ataaagttag 4080 

cactcaggct tcttccacaa cttcctgaac cgtgtgagct gattttcttg ctattaaaaa 4140 

ttcacggtcc atggctgaga acagcagctg ccttctgttt gcaaagtcaa cgccaatcac 4200 

tgcccggccg cggcagactc ggccccacag gacctccttt cttttttccc tttgacctac 4260 

ttccctgata agtgacaaga cagccagact ctgggaacaa acgcccgtta ttcggccccg 4320 

agctgagcgg gccctgcttc ctgagctaat ccgcccggac agacggaggg acgtgagggg 43 80 

ctttgccgtc ggctccagct gtcagtctgc ccgtcagact cgacagtggc cccctctgtt 4440 

cctcccgctg cccccactcc atccccgact tctttttgtt tcctgtccct gacagacgaa 4500 

catctgttaa aactctgtct gggtgagctg tggccagcgg cccacaaatc cccaagccgc 4560 

accccagcct catctgggcg ctgccgggag cactgcctgg ccaccctctg gacatagctc 4620 

tgagagccac cggccagggc acgtgtggcc cgagtggcat ggtgcacgcc gctaagccca 4680 

ctgcccaaag gcccccaagc aggagggatg tgcaggagac aaaagtcaaa agaacagggg 4740 

cacgttccac agaggatggg gctggagggg tggcagtgag gaacagcagc ttccgaggat 4800 

ggcggtggca actcccaaat aaggcctcac fccctgctgtt tttagctcat tccacataat 4860 

tggaaaaaca tggcagaaac cgaagccagc tgctgccttg gtcctggggc tgtgtggagg 492 0 

gggtggggag gccggaggcc caggctctgc actcgactgc tggggatgag agtgactctg 4980 

agctgcagag agcagcatcg cagccgccat ggtcccattg agccccggcc acgctgggcg 5040 

gcagaggctc gtgggatata cctgccctgt ctcatggggg tcacttcagg aggggcgggg 5100 

gagccaggac acagcccagg gctagcggtc accctgcagc tcaggggcca cgtaaatagt 5160 

gccaccttga aggcacacag cagtgcgggg ccccccccgc caccaacgca tccctacctc 5220 

taggaggccg cctgtgtgcc cctgggaacg ctgctccctg tcccttgggg tcctggtgtg 5280 

accaccctct cagccccttc cttggggaag gcacctgact ccctacaccc agctggcttt 5340 

catttgctca aaatcaggaa aaagcagaat tcaagacatc acagaaatgt cttcgcctgt 5400 

aactccatga aagataaacg gtcagacacc caggagggag tcccagggac ccttgagtct 5460 

cacctgaggc tctggcttca aacctcgaga tgtttccagc catgctagcg ccgcccccca 5520 

caacctgccc cacacagtcc tcccttggga actcacagat ttggccccca cctgccccgt 5580 

ttcttctggt ggagtgggtg cgttgggttg gggtggggct ggggactctg gatgtgtctt 5640 

aagagtctga gtgattctga cacagccagg ccctgccccc ctcctgacct tcgccccaca 5700 

ggaaagggag ccacacgcct gaagcgccca gcacaccccc ctccgtcctc cccaggtcac 5760 

ccgctggccg tgtgagccgt gctccccact gccccttcac ccaccccagc tcctcctggc 5820 
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agcacccagc cttggaagct acttctgatt acaaccgccg aaggaagact cgctccctcg 58 80 

gcactgaccc agacagcctg caccatcacg ctgctcagca caacccacac agccttcctc 5940 

caaaccccat ggagcgggga gtataatcac cccctttcta ccaacggaca aactgaagca 60 00 

cagagaggtt aagtcacttt cctaagctcc caacacgatg acaaaaaata gaaggtcagc 6060 

ccgcaagtgg aactaggtgc tccaagtccc cggtctgcct gacactgcac ctcctcgccg 612 0 

ccacggtccc gggtccgcct gacactgcac ctcctcgccg ccacggtccc gggtccgcct 6180 

gacactgcac ctcctcgccg ccacggtccc gggtccgcct gacactgcac ctcctcgccg 624 0 

ccacggtccc gggtccgcct gacactgcac ctcctcgccg ccacggtccc gggtccgcct 63 00 

gacactgcac ctcctcgccg ccacggtccc gggtccgcct gacactgcac ctcctcgccg 6360 

ccacggtccc gggtctgcct gacactgcac ctcctcaaca ccaccacggt cccgggtctg 642 0 

cctgacactg cacctcctca ccaccaccac agtcccgggt ctgcctgaca ctgcatttcc 6480 

tcatcaccac agtcccgggt ctgcctgaca ctgcatttcc tcatcaccac ggtcccgggt 6540 

ctgcctgaca ctgcacctcc tcaccgccac ggtcccgggt ctgcctgaca ctgcactttc 6600 

tcaacaccac tccttggccg gctcccaact acaaaccaag ccatgtcttc catcctgaat 6660 

cctcttggcc taaacatcac tcacaatgcc tccctcggga acaggcacaa gtcccaccag 6720 

cacagcctcc ttcgttacct gcgtttccgc tagcccaggg ccagctccag agccctcacc 6780 

acagagcctc tatccttcac ccccggacac tggacctcac caacccatag cctggaggag 6840 

atccctgtgt gaccccaggg cctcctctgc ccgactctga atttcactgc ccaacgtgac 6900 

acctcggaag gctctctggg cactggcagc cctccatggg caccgctcct tctggccagc 6960 

tctgacatcc cggctggtga ggtgccctgc acgaggcctc tgcccactgg gacctcacag 702 0 

ccgtgctgtc agctgcaaca agcgacagaa tttcacgttt tcttcacgtt gcccctgggt 7080 

gagcagctcc aggtagtttt cagtcgaggc gaggcgtccc gtcagcagcc aggcggcaca 7140 

gctaattcat gcccgccggg cgcacggccg caataccaat gggcacctgc agcctggaaa 72 00 

gccacagagg aaccgagaac agcgactgtg ctcaggtgac aggactgtgg tcttttaaca 72 60 

aaacattttc ctttaacgtg atattttacg gcaaggaatg aaacctggag ggcaggacat 7320 

ttggatacta aagccccagg ctgccgcgtg gtctgctttg tgaagtctga agcccgcgcc 73 80 

ccattctggc cccgctcaca ggtccggctc tgactcacca gcttcaatgc taggccgtgc 7440 

ctgtcctcca accagaacat gacttcctta aggacaaagc cgtttctcgc ccatccccat 7500 

ctccctctgg attaagaaat atgggaagat cttctagaac cacctcaaat ttgcagagag 7560 

ccatcctggt gacaaaccct tgaaatgctt ctaagaagag tttaggtttc ttctcaactc 762 0 

taaaacctct agaaaactct atttccacac cagctgcccc tggaacactt cagcttcaaa 7680 

agggcccagg gcagggagac ggaggagcca gcatccacac cgagcaccag cctgttaatt 774 0 

aacgggaagc gggtggggcc catctccagg cagctctgag gtcagactgg ggaaccatgc 78 00 

ttacaaaaaa aagtgaactg aaacgctcac gtcctcatgc aaaaccagac tcccagttgc 7860 

atctttctgt ctcattgagg agctttttcc tccctttgac agaacaccct acacacggca 7920 

tctggaacca aagcagaaag attcaggctc agagtaaaac agtccccaca ctggctgcat 7980 

gtggacgttc ccggcccaga gtctcgccca agcagggcct ataaatgaca caaaatgttt 8040 

ttctcctgcg tgccagtcat gctccaactg agttatgtgt aaaagtgcct ctcacggctg 8100 

agggcaaaaa cagttcccac aagactagag aaaggtgacc cctgacggct gagtctctag 8160 

ggagcgtgga gctgcgtgct cagccctgcg gccctgacgg ctctggaatg gaaaagctat 8220 

ccaactggaa gggcagggct cgctgctagt ccagcggtcc aaccccacag gtgtctgtgg 82 80 

tgtcagctcc atgccacaga gcccagggct ggggccagag ccaccaggcc ccctgccagc 8340 

ctgcaggggc ctcctcctct gggtagccta accaccccct gtgagcgcag gcagcctcct 8400 

ctaatcacca cagggcctgt ccccccctct cccccgcttg caggaaaatg agccctgagg 8460 

actccccagg gctgctctgg gcctggacat ggagactggg aattacattt gcagaaggag 8520 

cgcaatgccc ttgaagggct cagccacgag cagccagtcc ccagggctca gaaggcccag 85 80 

ctgttagaac cctgggagcc agcaaagagc caggggctcc acctaagtct atagcccctg 864 0 

cctcttctgg ttgggaaaga aatcaacgcc cctttactgg ctcccactga cagcccactc 8700 

ccccaggtat gggaggattc tgggacgatg caggcaaacc tggaccctga gtgaacctgc 8760 

cccagctctc acgggcctgg caccagccac agcacctaag gcgccggtca tggtgacaac 8820 

atgaaggtga taagggcatg gacagtggac atggcagctg gacactgggc acccactgga 88 80 

tgccaggcac ccagcacggc tccgtcaccc ctggatgagc agtggccctt tgcaagccag 894 0 

ggtagcctgg gcaagttatt tgggggtctc caagcttgtc cagctgtgcg acttcactga 9000 

gccatgagtc tgggatttta tcagggccca cacccgttcc tggaactctg atacgtgagg 9060 
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gagccacaca gggaccctta acaaaagctc ccagggcaac atgttctctt gcctcagtct 912 0 

cccaaatagc tgggattaca ggcgcacgac taccgcccgg ctaatttttg tatttttagt 9180 

agagacaggg tttcaccatg ttggccaggc tggtcttgaa cccctgacct caaatgatcc 9240 

ttccactgtt agggcaaggc acctgacagg cacgactgca cgatctgctt gttgggggct 9300 

gtgtccattc cccactcctt cgacaaatgt ccacacccag ccttgctttg acaccccaag 9360 

aacagagatg gtgacacctg cttcctacat gcccattgct ctcccaaggc agacatcccc 9420 

agcagatgca acacagtgtt taggcagaca tcaccaatcg atggtggcaa cagacaccag 9480 

gccctgctcc ctctaactcc agtggccagg ccccaagcca gctctcacct gcccactccc 9540 

aacccacagc agcaagactc agaaatggca aaaacacaaa gagaacagaa acgccccata 9600 

gcgggaggat gactaaaaga catgtcttga taagatattg ttcaggcata ggccaggcac 9660 

agtggctcat gcctgtgatc ctagaacttt aggaggctga ggtaggtgga tcacctgagg 9720 

ttaggagttc aagaccagcc tagccaacat ggtgaaaccc catctctact aaacatacaa 9780 

aaattagcca gacatagtag cgggcgcctg taatcccagc tgcttgggag gctgaggcag 9840 

gagaattgct tgaacctggg aggtggaagc tgctgtgagc cactgtactc caacctggac 9900 

aacagagcaa gactctgtct caaaaaaaaa aaaaaaaaaa gatatccttc actaaaactc 9960 

atgtctttga tacatattta cctcctgcaa tcgcaaatgc ttctgcagtg cataaagtga 10020 

aataaatagc aggaagcctt acggttcgat cacccacaca gacacacagt cacatacagg 10080 

aaaaacgcag ggagggctgg ggaacaaaaa aacagaagat aaaatgtgga gacagacaca 10140 

ccaagagagt aagagaccac ctccagacct cccttcagct tctcaaacac acgagccggg 102 00 

cccgttacag aatttgcggg gaccgctgca aaatggaagt gcagacagcc ccttactcaa 102 60 

aaggtaggaa tttcaggtca acaacagagc tcacctcata tgactacaca ggtcacacag 10320 

cccgtgaagt cggtcccaac accagcatgc tcctgcctca aagccgctgc acgtgctgtt 103 8 0 

ccttctcgcc tttccctctt ttagtccttc agatctcagg cctcctgaga gagacctctg 10440 

acctgccggc tcaggcggcc acacccccag tacaggagtc tccggctcag cccctgctgt 10500 

gttccgtacc cgatccaggt ctgtcctatg tccatctgtg tgccggcttg cttcctgaca 10560 

tggcccccac cacacgtgtg cctcggggca ggggaacagg cccgtctcat taactgcttt 1062 0 

cttctcagat attttctgga atatttgtgg atattgggca acatatatgc tccacctttt 10680 

tcagactagc caggacgagc tgcatttttt tttttttttt tttgagacag ggtctcactc 10740 

tgttgcccag gctggagtat agcggcatga tcttggctca gtgcaacctc cgcctcctag 10800 

gctcaagcaa ttctcctgcc tcagtctccc aagtagctgg gattacaggc ccgtgccact 10860 

actgcccagc taatttttat atttttagta gagatggagt ttcaccatgt tggccaggct 1092 0 

ggtcttgaac tcctgacctc aaatgatcca cctgccttgg actcccaaat tgttgggatt 10980 

acaggcgtga gccactgcgc ccggcccgag ctgcctgttt tacacctttg ccatattccg 11040 

gtgattctct ctcccctccg tcccccggcc ctgactgtgg tggccactcc ctgccgtcat 11100 

gagcccgtat gtcctcactc tttccctttc cgccaggact tcaaccaaca ctgcagagcg 11160 

cagggtccag ctccagcact gagttcagcc tcttctcacc aacagacagg caggaaagaa 1122 0 

aacaaactct gagaaggcca aggttcccgg gcagccagca agccaagcat ccttctccgc 112 80 

tgaggcttgt gcagccgagg caccccctcc tccagggagc aggcagcgtc ctggggcagt 11340 

ctgcgaggga gaccagggcc cttgctccac cagggcccca ggtatggggg cagcagcaaa 11400 

ctcatggctc tgggagccag accccacctg ctagaaccta ctatgccacc tgctgtgggc 11460 

aaccccaggc tggtgacttg ccctggcctc ctctgtaaac aaagggctca tccaacctgg 1152 0 

tcaaaccact cctccccttc aagggtctat aatcctccct taacctgctt ggtccaaacc 11580 

cctggtgtcg ccaggtcact caggaggcag ctcatctgga ctccttccct gggtccagtt 11640 

tctctctcaa cattgccttt gaggccgagg tgaacggtca acagcgaagg gccccagagg 11700 

tgatggagga gcgggtgtcc aagacactca ccctttctaa tgcactgact ccctcgtgga 11760 

ctcacttgtg ccgtctcccc cacccaccca gccccagagc ccagagtgcg agcgccagag 11820 

gcccgggatt ctgtctgcac cgcggggtcc ccagtgcctc ggagcaatgc cagcacccgg 11880 

caagtgttcg acaaatgcct gctgaatgag caaatggatg gatgaacgaa tgaatgagca 1194 0 

agcagatgaa tgaatggggt gctgtccaga gccgtgagga ctaggccgcc caagtcccca 12 000 

tttctcaaat tctccttctc ccgacttggg aaacaagatg cttggtcggg gaggctctcc 12 060 

aaccatcccc tgcagcagcc ggcacagcgg acagaccctt tgatgtaaca gccatgtctt 12120 

cattaaagat gccctgctct cagaaagaga aagacaaata caaacctgga aaatcctcac 12180 

caaacgcagg acccctgcca gggagcagag aaaagaccca cacgccacgg gcgccacgac 12240 

cacacacaca ccccagccgc tgcacacaaa cacagaccct agccagcaag aacaggggga 12300 
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ccaggaaact gttcctaaag tcaggacccc catgtgctca gacagcagtg agagcaagga 12360 

cacttctcca tccaccggat gccaggagag tccttttagg gggccccaca ccgagactct 12420 

gcccttagga ctgttcctga gtgtggaagc cagcccactt ggaagccccc tgccctcccg 12480 

agtgggacac cggcacagga agcaggccct gtcccccacc actttctgca agctgggccc 12540 

catcacgcta cagaaacggg gaggactggt cccagggatg gcgctttcct gacacctctc 12600 

gttaccccct cgcttgccag gccccagggt cagccccaga ggccagactg gctatcccag 12660 

gcccgggagc atccccgaag gcgagctgca tcctgaacgt gtgtgatttc ccgaagggcc 1272 0 

cgccccgaac cgacacctgg aaagaaagat cctcagccgg tgccccagag gagaagagcc 127 80 

atgcctcact gcaacacagt cccaggaagc accaagtgcc tgaggaccaa ggcggagagt 12 840 

aaaaaagtgg aaaatatctg gggcaaaaat aaaacaaaac aaaacaggat tgacctcctg 12900 

ggctcaagca atcctcccaa ctcagcttcc cgagtagctg ggaccacaga cttgaatcac 12960 

cacacccgcc aagtggatca tttcgaacgg gtttgccgag gttccttctg gggcaccccc 13 020 

ggcggccgca acccattccc gccaggcccc gccccgcccg cccgccccgt cccgtcccac 13080 

cgcctcacct gccttacacg tcctgccgtt gtcctgcagc tgcacacccg tggggcaggc 13140 

gcatgtgtag aaaggctcgc ttggggacag caggcacagg tgggagcagc cgccattgtc 132 00 

ctcctcacag cgagtgtgga ctgagaaaac caggacagac tgagagaagg ttccagaaga 132 60 

ggaccgtcac ttgtttctga atgagtcaca tcctgcctcg tcccccgtga cagcctccag 13320 

tgtgtccctc tgcccaaaca tcggcctcaa gtggcatcag ggacctcccc gcgggcacca 13380 

ttccacctgc ctcatcgctg gccccgtcca catggggccc tcagcctggc cagacggcct 13440 

gcaatttccc caaaaccagc cgtgaccttc ctggccaccc tcacacccag atgtgacctg 13500 

cccatggagt gacatcctcc ccatctgctt cctcccacca agctcctatg actagaacac 13560 

cctccccagc tcctcggagc ccccaaagga cacccctctg caaaggctgc cccccacgct 1362 0 

ccaatggccg gggtcaggac ctgcctgtgt ggtagtgacg ggaaccccag agacaatggg 13680 

ctcctgggca aaaggcttgt cttgtctttg tgctatgtgt ggacccagca gcttccatag 13740 

gaacactgtc cttcttgctg ggatggccaa gcttgtcact ctcccaagcc ctcctatgac 13800 

caacagcaat tgaacggaac tcgataaatg cttccagcac ctcattcaaa ccaggggaaa 13860 

gctgggtgta gcagccccaa aatacggata taactggaac aacaaactca tcaaaatgaa 1392 0 

cctctccctc cctcatgctg ccccaagtgt agatgggttt tgtgaccacg actttctcac 13980 

caggaaacag ctccagagag ccccaccctc ctgtgtcctg ctctgggaac agctggcacc 14040 

cctaggcccc acatttcaat tcaaagtcca aaccttccat aatggcctgg ccagaaatct 14100 

ccatccctgg tccctgtggg agtgggccac tgtccccaga gccgcagccc cactgtcaca 14160 

gaagctggtg catttcccca tcagggacct ctgtcacaac ccagcgtggc ccccaggctg 14220 

agaactgctg attctgggca gattattcat tgataaatac gcgacttgca gggccaagca 142 80 

tggtggctca tacctgtgac cccagcactt tgggaagtca aggtgtgagg atcactggag 14340 

cccacgagtt tgagacaagc ctgggcaacg tggcaaaatc tctcatctct attaaaaata 14400 

catacacaca cacacacaca cacacacaca cacatatata tgtatatata aataaccata 14460 

tatatatata cacacatacg tgtatgtgta tataaataca tatacacaca cacacagaca 14520 

acttcttctg ggccttgaaa acgaggcaac cttccttgga aatccccttg ccactgctga 145 80 

gcctgaaata gcccccatga gctctgcaga ggggtcctct gcaggcccgt gtcccccagc 14640 

cagccacaca cctccctcca ttgcagcagg taccccttta gagagggggc cccccagagc 14700 

atgggcttct gcagggaggg gtcacctgcc cccccacccc acccacgccc gcgcaccccc 14760 

acgcccccgc atcctcccac tcccctgccc cgcgcccccg ctccccccag ccccctcacc 14820 

ctctcccccg tgccccaacc ggcactcaca aaaaggctgc cgctcctggc tcagcacctg 14880 

gatgtccatg ggtgagtata gggcactcag gatctccttc ctcttccccc cagtgcgctt 14940 

gttgcaggca tggatggagc gggtctgcca gtctgtccag tacagagtgt ccccggagag 15000 

cgtcagggcg aaggggtgcg tcaggctgcc ctccaccacc ttctgcctgc agtcagggaa 15060 

gcggggtgga ggagccatca ggagggtccc ccgacagtca ttgctgctga cccaattaat 1512 0 

ttcttttttt ttttttgaga tggagtctcg gtctgtcgcc caggctggag tgcagtgatg 15180 

taatctcagc tcactgcaac ctccgcctcc cgggttcaag caattatcct gcctcagcct 1524 0 

cccgagtagc tgggatcact gatgcccacc actacgccca gatgattttt gtatttttag 15300 

tagagacagg gtttcatcat gttggcaagg ctggtctcga actcctgacc tcaggtgatc 153 60 

cacccacctc agcctctcaa agcgctggga ttacaggcgt gcgccaccat gccaggcttc 15420 

ccatttgctt tcaaccagac aagtgaggcc aggtcaagag ccccaggagc tggcgccctc 15480 

gtacatttct cccggcgtgc acagggcacc tcccaaacac agcctgtgat ggtgacacac 1554 0 
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gggctccccc aggtcaagtg gcaaagtctc 
ggcaaaaagc acacctctcc tgcccaacgc 
gcactcgctc cttcttacaa tgctcataat 
tcaacccgag aaacagctat tcttttctag 
gggtgtaatg agattatgga cactggaaac 
gcagttttat ttgcttaatg accctcgccc 
ggcagaggtc cccgccacgt ccccagccct 
atgaagtggc acagggaaga tgggagctgt 
catttttgtt gatgctgttt tttaagaaaa 
ttactttttt tttttttttt ggagatgggg 
tggtgtgatc tcagctcact gcaacctctg 
cgcctccaga gtagctggga ccacaggcgt 
gtatcttttt ggtagagaca gggtttcgcc 
agctcaggcg atccacccgc cttggcctct 
tgcacccagc ctaaatttac cactttaaag 
cggtgcagcc tccacttctg tctagttcca 
cacacccggc agcagtcatg ccccagtccc 
gacttaacta cacacattct caacatctca 
atgtctgtct tctttgactt ggcaccatgt 
agtgcttcat cccgttttag gggtgaacca 
tgcatccatt cacccactgg gggacctttg 
gtgctgctac ggacattact gtccattcac 
aagagtatac agctaggagc ggaattgctg 
aggaatcaac aaactgtttt ccacaatgtt 
cttgctctgt cacccaggct ggagtgcggt 
ctcctaagct caatccatcc tcctgcctca 
taccaccatg gccagctaat tttctaattt 
agagtctcgc tctgtcgccc aggctggagt 
tctgcctccc gggttcacac cattctcctg 
tcaccggcca ccacgcctgg ctaatttttt 
tgttacccag gatggtctcg atctcctaac 
gttctgggat tacaggcgtg agccaccacg 
ttattttatt ttattttttt tttttttgag 
gtgcagtggc gggatctcag ctcactgcaa 
tgcctcagcc tcccgagtag ctgggactac 
ttgtattttt agtagagaca gggtttcact 
cctcgtgatc cgcccgtctc agcctcccaa 
gcccagcctt tttttttttt tttttttttt 
ggctggagtg cagtggcggg atctcagctc 
attctcctgc ctcagcctcc cgagtagctg 
taatgttttg tatttttagt agagacgagg 
ctcctgacct cgtaatccgc ccgcctcggc 
ccatggcgcc cagcccatgt ggccattttt 
cggttgctcc ctgggccatg ctctgcctca 
gcccttgccc cagccacagt cacagataaa 
gctcgtcacc caacacacga cgcttccatg 
atggccgctg cagaggcaga agaatcccgc 
agaccagacc ctggagctac agaccctatg 
ctctgggcaa attcacatgc ttctctgtgc 
ggcacagtgg ttcacaccta taatcccagt 
taagctcagg agtttgagac cagcctgggc 
aataccaaat aaattagcca ggcgtggtgg 
gctgaggcgg gaggaacact tgagcccaag 
cacgcctgta atcccagcac tttgagaggc 



ccccagggaa gaaaggagga agccatgcct 15600 

tttaacctct gtatacaaat caggccatgt 15 660 

ttatactttc agagtaaatg aaacttggca 1572 0 

atgcttacag tgcccagcaa atgaggactc 15780 

aggatcataa tgtgacgtgg tcggtaatgt 15840 

cgtgacaggc tccctgaggg tgggcctggg 15900 

cagcacagtt gccaggagag ggtgacactc 15 960 

gggctctgca gatccaccac ctcttctgtt 16020 

ttattgaagt aaaattcaca ggacatacgt 16080 

tctcactctg tcacccaggt tggagtgcag 16140 

cctcccaggt tcaagcgatt ctcccacctc 162 00 

gcaccaccac acccagctaa tttttggggg 16260 

atgttgccca aggctggtct tgaagccctg 16320 

caaagtgctg ggattacagg cataagccac 163 8 0 

tgaatagtgt tacctagtgc attcgcaagg 1644 0 

aagcacttcc attgccccac aggcaaaccc 16500 

cgcccccagc cccggcaaac acttttgatg 16560 

tataaacgga atcacaatat acagcctctg 16620 

tttcgaggtt catccaggct gtagcatgtc 16680 

tattccagtg tgcagacaga aaccaatctg 16740 

tgtcatttcc accctcggct gttgtgcaca 16800 

attttgtgtg aagacctgtt ttcgattctt 16860 

ggtcatacgt aaatcaatgt ttacgtctca 16920 

gtcttttttg tttgttttct gagacagggt 16980 

ggtgtgatca tggctcactg cagcctcaat 17040 

gcctcctgag tagctgggaa cacaggtatg 17100 

tatttttttt tgtttttgtt tttttgagac 17160 

gcagtggtgc catctcagct cactgcaagc 1722 0 

cctcagcctc ccgagtggct gggactatag 17280 

tgtattttta gtagagatgg ggtttcaccg 17340 

ttcatgatcc acctgccttg gcctcccaaa 17400 

cccgacctta cttttaattt tttaatttta 17460 

acagagtctc gctctgtagc ccaggctgga 17520 

gctccacctc ccaggttcac gccattctcc 17580 

aggtgcccac cacgatgccc ggctaatttt 17640 

gtgttagcca ggatgatctc aatctcctga 17700 

agtgctggga ttacaggcgt gagccaccgc 17 760 

ttttgagata gagtcttgct ctgtcgccca 17 82 0 

actgcaagct ccgcctccca ggttcacgcc 17880 

ggactacagg cacccaccac cacacctggc 17940 

tttcaccgtg ttagccagga tggtctcgat 18000 

ctcccaaagt gctgggatta cacgcgtaag 18060 

cagtgagaga agccagaggc ccatcactct 18120 

gccagaagca ctgagggaag gtcagcctcg 1818 0 

ggggcctgca caggtctgtg tggctccaga 1824 0 

tgaatagccc caggtgcatc atgaagagcg 183 00 

ggggaagcag gtgggagaga ggctgagaac 18360 

ttccaaccct ggctgggact agctgtgtgg 18420 

acaggggatc aaaatagcaa acacaggcta 18480 

gctttgagag gccgaggtgg acacatggct 18540 

aacatggtga aacctcgtct ctacaaaaaa 18600 

tacgtgcctg tggtctcagc tacttggaag 18660 

aagtcaaggc tgtggccgcg tgtggtggct 1872 0 

tcaggtgggt ggatcacttg tgatcaggag 18780 
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ttcaagacca gcctggccaa catggtgaaa ccccgtccct actaaaaaaa tacaacaatt 1-8840 

tgccaggcgt ggtggcgggc acctgtaatc ccagctactt gggaggctga ggcaggagaa 18900 

tagttagaac ttgggaggtg gaggttgtag ttagccaaga tggtgccgct gcactccagc 18960 

cagggggaca gagcaagact ccatcccaaa aaaaaaaaaa acaaacaaac aaacaaaaaa 19020 

agaggtcaag gctgcagtga accatgattg tgccaatgca ctccagcctg ggtgacaaag 19080 

tgagaccctg cctcaaaaca ataaaaatat aaataaaaat aaaacataat agcaaacgtt 19140 

tcatagaggt ggtatgagca ttaaatgaac tgataaacgt ccctggaaaa cagtaagtgc 192 00 

tatggaagga ttcgctgccg ccaccgccac caccattagc atgtttcaac ctccatcacc 19260 

ctcactgtcc cctgtcacca tcctttgacc agggcactcc cagctgcagc ctttctatcc 19320 

tcttgtccac ccttcataac tgtaagatca ctcagctccc aagaaccaca gtctacaggg 193 80 

taaccacatt tccaaatctc aaaccagacc cgctggtctg cacttccagg gacaacagga 19440 

tattttcaaa ccagcccaaa agagatgtgt ggctcagcat aagaggaaca ggagaaactg 19500 

aggcctcttg ccctgagaat gagcttggaa gtggatgtcc cggcctcact caaaccttca 19560 

gatgactgag gcccagccag gagcttgagt gtaccctcag gtcataccct gagccagaag 19620 

cacccagcta atccactcct catcactgac tccctcccca taaaaaacct gtttgctgtt 19680 

tcaggctgtt aagttgtggg ctgttttgtt acacagcaat ggataactaa cacacgaggc 19740 

ctggcaagtg tggagcaaag ctgcccaagc cctcaagtct gttcatgtgg gtgttggcct 19800 

gtgtttgcag aaatccagcc actgagtcct cccatgcagt cactactgcc ctctgcacag 19860 

acacctgcca catccctgcc tgggccagga gctccactag tgcaggaatg gggtctgccg 1992 0 

tcccaggagg atccctgaca cctagcacag ggctagcagc aggcagcact tggttagtga 19980 

ataaactgcc cttcacctgt acacagaagg gatgtttcta taaggggtaa ttaagtacag 2 0040 

agctgggaag ctatgctgac cagaaggctc taaaagcaat taaccaacga ggggaaaacc 2 0100 

cttcctactc attctcggcc cattttattg agcactgacc atgtggaagg ccccctggtg 20160 

agactgggga atgcaccaat aactgagaca gcttccggct gttgccctca ggatgcctga 20220 

gctgggatag ggccagggtg ggggtggtgc gtgtgacagg gttactgttc acaaccctgc 202 80 

cgggccataa gccctcccca acaattccaa aatccaaaac gctctgaaga tggaaagctt 20340 

ttgttgctca tctggtgaca aaacctcatt tggtgcatgg gccgggtgcg gtggctcacg 2 04 00 

cctgtaatcc cagcactctg ggagccgagg ggaaggatcc cttgagctta ggagtttgag 20460 

accagcctga gcaacatgtg agaccccgtc tctaccaaaa atacaaaaat tagccaggtg 20520 

tggtggcgca ctcctgtagt cccagctact cgggaggctg aggcgggagg atcgcttgag 20580 

cctgggaggt gggggctgca gtgagctgag attatgacat tgcactccag cctgggtgaa 2 0640 

agagtgagac tctgtctcaa aaaaacaaag ttaaaaaaaa aaaaactgtg catgggtgtg 2 07 00 

ggctacagat agtcttttct gccctactta gaatgaacgt gccacatttg ctatagaaat 20760 

attcaagggc tggtggcaaa tgccacacag accctgacgc tgttccaagt tctgagaagt 20820 

cctgcattcc tcagggcccc agagtttcag agaagagtct gtaggcctga gttaagaagg 2 08 80 

aacgccttca aaagccctgg ggacaaaggg gaaaggggtg ccccaggact gcgtgggtac 2 0940 

ctaccggaac gagccgtcca ggttggcacg gtggatgaag ctgagcttgg cgtcagccca 21000 

gtagagcttc tgctcctcca ggtcgatggt cagtccattg ggccagtaaa tgtccgagtc 21060 

cacaatgatc ttccgggtgc tgccatccat ccctgcccgc tcaatccggg gcgtctcacc 2112 0 

ccagtctgtc cagtacatgt acctgtgacg ggggcagggc aagagaagca gctaacacag 21180 

atctgttttt tgtttttgtc tgcatagatg cagacatgaa acaacagaca gtgaacttgc 21240 

cctaaaatct cacccatcgg aaataaccaa caggtatggt ttcaggtatt cctgccttaa 21300 

gctgggcaat caaaatatac tatttccaac ttgttctcag ttaacagtaa attctgggca 21360 

ccttcccttc ttgtggatag aaagattcct tgttcttttg atgattgcct agtgtactct 21420 

gctgtaagtt ttttaaagaa cttcaggtta tttctgattt ttttgctacc atgaaaatgc 21480 

tgtaaatgaa cctctaaaag gcaattcaaa acactcagga tggaatatta tttagtggta 21540 

taaagaaatg agctatcggc tgggcccagt ggctcacacc tctaatccca gcactttggg 21600 

aggccaaggc gggtggatca cgaggtcggg agatcaagac catcctggct aacacagtga 21660 

aaccccgtcc ctactaaaaa tacaaaacat tagccaggcg tggtagtgag cacctgtagt 21720 

cccagctact taggaggctg aggcaggaga atcatttgaa cccgggaggg ggaggttgca 21780 

gtgagcagaa atcgcaccat tgcactccat cctgggcgac agagcgagac tccatctcaa 21840 

aaaaaaaaaa aagaaaagaa aagaaatgat ctatcaagcc atgaaaagac atggaggaaa 21900 

cttaaatgca tgttagtagg tgaaagagcc aatctgtatg agtccagttc taaacactct 21960 

ggaaaaagca aatacacaga gacagtaaag catcagtggt tgccaggagt tggagaggag 22 02 0 
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agggatgaat gagtggagca cagaaaatca gggcagtgga actatcctgt atgacatgga 22 080 

atggtgggtg catgtcctta ctcatctgtc taaaccaaga atgtacaaat caagggcgaa 2214 0 

ccctcgtgta aacgtggatt ttgggtgatg gtgcgtcagc cagctttcat cagttgtaac 22200 

aaatgtacca ccctgcacag gatgctgaca gttgggaagg ctgtgtgggt gtgaggacag 22260 

ggatgtatag gaactcagta cctgctgctc atcaattttg ctgtgaacct acaactgttt 22320 

gaaaaaatta agtctattta aaaacaacaa aacatggcca ggcacgatgg cttgcacctg 22380 

taattccagt acttcgggag gctgaggtgg gtgggtcact tgagccaccc tgggcaacat 2244 0 

ggcaaaatcc cacctctaca aaaaataaaa attaaaaaaa agttagctgg gcatggtggc 22500 

acactcttgt agtcccagct acttgggagg ctgacgtggg aggatccctt cagccctggg 22560 

aggtcgaggc tgcagtgagc tgtgactgta ccactgcact ccagcctgga tgacagagtg 22620 

agaccctgcc taaaaaaaaa aaaaaaaagg ctgggtgcgg tggctcatgc ctgtaattcc 22680 

agcgctttgg gaggccgaga tgggcggatc acgaggtcag gagatcgaga ccatcctggc 22740 

taacacggtg aaaccccgtc tctactaaaa gtacaaaaaa aaaaattagc cgggcatggt 22 800 

ggcggacacc tgtagtcaca gctactcggg aggctgaggc aggagaatgg cgtgaacccg 22860 

ggaggcggag cttgcagtga gccaagatca caccactgca ctctcagcct gggagacagc 2292 0 

aacactccgt ctcaaaaaaa aaagaataaa acccatggct gggatggacc ctgaacctgc 22980 

agctgcagct gttcctgggt aggtctgtgg gcgacgtggc tttgcttctc catgttccca 23040 

agagacaagc atcacccatc catgagaaac aagcacatcc tcagggcgcc cttacgtgat 23100 

ctctggccaa tgaaccaaga caaagtgagc agacaccagg tctgggatgg caggtcccac 23160 

ccccaccagt gcccagtgtg ccctgtttgg aggtgaccac agggtgtgtg cccagaggct 23220 

gggcgtgact ctcagcggag accagagggg aaccacacca gcttggagga ctcagttccc 23280 

atcccagcca gctgggatga gccacaggac acaagggctg gcagacctat tgtgttttgt 23 340 

ccacccttca cagcagagaa aggggacagt gcccagaatg tcctctgagg agcctcctcc 23400 

cactcttggt ccttgtaaaa tggtgctgac tcccttgctc ccttcttcct ggggtgggcg 23460 

gcaaacccca ttcccctcag ccttagcaag tgatttagaa acaggcagct cgcccaagcc 23 520 

aggcatgaga gtgatcccgg gacacaggga gaacaagccc cgctttgccc tctgggggtc 23580 

tccattcagc agaagaggca aatgacagac acacagccgc ctcctccccc accatggtgc 23640 

tctgcagcct caggagcctc aggtgcacca agggccaccc catccagggg gccatgcttc 23 700 

cttgagtggt atcgttcctg agcgagtacc atctccacct tccagagggg ctgtgacaag 23 760 

atcaacaaga atgagggcat aggagcctcg aaccaaacat gccctcttcc ctgcagaggc 23 820 

tgactgcgcc cagctgctat caccaagccc ctgctcctcc ggccccgtgg ggacagggta 23880 

agaggggtgt cacatggaac agctctccaa acagtccctc tcaagctgct gtctcctgtg 23 94 0 

catctagtga gaacccaacc aacaaaggga aggtgggaat tgctattccc attaggcaga 24000 

tgagaaaact gaggccccga aaggctggcc tgttccaggt tacaggcgct gagcggctgc 24 060 

tctgggaaca cacttggtgt ctgctgaggg cccgagcccg gccatcatat gactcaccct 2412 0 

tcgccagcaa agcccgggtg tgggtgaact tttcctggca gcctgggact ccaaggtgct 24180 

ggcagccagc ccagggaagg ctcccgcgtg cctgcggcag acgccttgct ttacctgcac 2424 0 

gtccccaccc ctaggagcct ggacagagcc cagaccctcc gccacctcct gagaaggtat 24300 

caggggcatc agtctggact tgggggggaa tccacacagg ccttccccaa atgctccacc 24360 

gtggcccatg gaaaaggctg gaaaacgtgc aggagcagga gcctccgcat ggagcataat 24420 

tcacattcct tccccgagtt tcataacaga ggcctgctgg tttccttaaa tggggaattt 24480 

gcgagccagt cggtgaccag agactggttg gcgtggacgt gctcttgcag agtctcaaac 2454 0 

gctaccacaa gcccagccaa attccacgga ggaaaatcga cttccgaaga aaagagctgc 24600 

agcatggcct tcgtgcagag ccagctgcgg ttgtggttgt gtgttatttt agggaagggc 24 660 

cattttgcat tttaaagagg gggttgggtt tcaccctggc tttaatttga gacccggggg 24720 

ccactgcagc cccttgtcag gctggtacag gccggggact cctcccatgc taagccagtg 24780 

tctttctggc cccagatcct caggggccag agggtcatcc ccagagcccg ctctgccacc 24840 

cacatgggta ccctgggcct gggagggatg tgccttccct caaccctgcc tggatgtccg 24900 

cacggggcca cctgcattgc tgaaactgca acgaagtcga gtctcaggag gggcccccct 24960 

ggctgcaggg ctcttgatcc ttttggccac gtgcacactg aggtggacgc tcggacccag 25020 

agaccccctt catgatgatg gccggggcag gaaccccctc ctctgaggaa ggaccctggt 25080 

gggggacagc actgcaggag ggcacaggag atgacggggg ctctagcagg gccgggagga 2514 0 

aggccaagat gctcctcgca accgtgtgcc tgtggccagg acagaggaca aacccaccct 252 00 

ccactgtccc cactctcagg acagcagtcc tgccccagga ctcagcgccc acacttatgc 25260 
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ctgaggacca ctattcaagt cagtatttgg cgagcagggg ttgctgccgc gggcgctgtg 2532 0 

acaggctgga atcctctccc tctccctctc cctctccgga gacatggagc ctacagggac 253 8 0 

agagtcagca cctcagggta ggaccatggc tggcgtcatc agcatcactg gatctgatga 2544 0 

gtgggagccg gcatctcact gttttcactc tctcattcaa atgactggag caaagggaag 25500 

gtgtggggag aggcccagga atcaacacta aggtcaactt tgcccccagg ggcaggggtg 25560 

ggagtgaaca gccacaggtg tgatcctggg gagggcttct gggagagaat tcagaggcaa 25620 

gcatgtagag gaaccatttc aaatagttaa gaaaagccag agccaaacag ggacagttgg 25680 

ctcgcagaga tgatgcaggc aaagccagct cagatctgag catgggaaag actactccca 25740 

accaagggcc cagcatctcc caaccaagca ccaagtacct cccaaccaaa tgccaagcac 25 8 00 

ctcccaatca aatacctccc aaccaagcac ctagcacctc tcaactggac accaactact 25860 

cccaaccagg caccaagtac ctcccaacca agtgccaagc acctcccaac caagtaccaa 25 92 0 

ttacctccca accaagcgcc tagcacctcc caactgagca tcatgcacct cccaacagag 25980 

catctagcac ctcccaactg atcacctccc aacctagcac cgagcacctc ccaaccaagt 26040 

gcagagcacc tcccaaccaa gtgccaagca cctcccaatc aaatacctcc caaccaagca 26100 

cctagcacct ctcaactgga caccaacaac tcccaaccaa gcgccaagca cctcctaaca 26160 

aagtaccaat caccttccaa ccgagcacct agcacctccc aactgagcat catgcacctc 26220 

ccaacaaatc acctagcacc tcccgactga tcacctccca acctagcact gagcacttcc 26280 

caaccaacat agcaaaagcc ataaagaagt aaaaagacaa aaccacgtag gcatggagac 26340 

tggacttctg gtggcgagga aagggcattt ttattataac gacagctaac atttgttgaa 2 6400 

ctcacaaact gttcttggtg ttttcctcat gacatgcagc atggtcacgc ctctgtacag 26460 

acaaggatac tgaggcacag agtggcaccg tgccaacctt gtctcatctt tttatcgaac 2652 0 

ctacatgcag agtgccagca aatccagctg tcttttctct tcagaacaga tcccaaatct 26580 

cgccactcct tacccccaca agtgaggtgt ccccgctgct gctttctgtc gccaggatcc 26640 

cggtaataac cgtggagagg gctcctgccc ccacgccacc caccccacag ctcactctcg 26700 

ctccagccac caggggatgc cttccagcac gagtcagagc tggcacctcc tctgctcgag 26760 

acctcatgtg tcctctcctc acaccttggg ccctgtttcc ctacattctg ctacagcccc 26820 

tcaaacaggc cccgccccaa accagcccag ggcctttgca ctggctgatc cctctgcctg 26880 

gaccgcgctg cccccagaca gccacacggt tctcagcctc atctgcttcc agtctcgact 26940 

caaaagtcac caagaggcct tcccagcacc tgagctccga cggaagcccc tcgccacagc 27000 

acccaagcac tgctttatcc ccctacgcac acgtcccttt caaatactat tcatttacca 27060 

tctcctccca ctcactgaaa gggccagaga ctgggctata cccgctgcgt ggggagcagg 2712 0 

accaggcgca agggctcaca aatgcagtgg atgcctggtt gggaggtgag ggagctgcag 2718 0 

cgacccacgc tgggagggaa cgcaatgaca ggaggagcgc aggtcctggc gacacgatgg 2724 0 

ccatggcagc cgctggtgag caaccgcagg ccggccctgg gagagggctt ctagcaagct 27300 

gctatcttca gcctctccga ctactgcaga tgccccctcc tagccagaga cactgctaca 27360 

ccagccgacc cttccaaaaa gaaggtcagt aaccccgcga ctcctggagc cacagtgcag 27420 

ggggagaggg ctgagagggc aacagttcac caagcggaac agaggctgcc ccggaggtca 27480 

gctggctccc cggcagctgc aggggtggct agcccactcg gagggcagcg agggcatacg 27540 

aggggctcca gggatgagtg gttgcccagc acagcacccc tgggaggccg ggggcacttc 2 7600 

tcaggtagtg ggggcacgag gctgctctgg cctgacctca gggactcaaa atactttggc 27660 

gataaattcc accgtgtccc acccctgctg gtaccccata cttacacaca gactggttca 27720 

gatgcagaca ctctcgcgca catactcgct cacacgggca catacacgtg cacacacagt 27780 

cacatgcgca cactcataca cacacaaata tccactcaca cgcatgcatg cacacacacg 27840 

gacacacaca ggctcacacg tatgcacgca tatgcgtgca cacgcacaca cacacacaca 27900 

cgctcacatc ctcccactcc cacactcagt tgctcagaca cacacacgcc tggctctcac 27960 

acaaacctgt tgggctctga aaggctccag cccttcccat gctcgtcaga agccagtcaa .28020 

tggcttccta agtcaccaca cagatcaaag aggtgaactt ggccacatgg cactctgctt 280 80 

cctgagctcc caaacaccag ccttggtgag gacagaccct caccccacac cctcattccc 28140 

actaccctgg gcaggcccag aggaggggca tctgcaggat ctggcaacca gcccctcccg 282 00 

cccggctcct gcagccggca ccatgggagt cagggggagg tcactgcaaa gggcaacagc 28260 

aagttggtgg ccccaggact agagcccagg ggtcttcagt cctactccag agcttggaca 28320 

ctgtcccaca gggcatggcc aagggaaggg cttccagagc cctgacttca gggaggaggg 28380 

caggcgggct cctgtggcag gcctggatgc atggccgccc actcctggga ctttctaacc 2844 0 

tagaatatct aggtcaggct gggtgcagtg gctcacgcct gcaatcccaa cactttggga 28500 
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ggccgaggag ggtggatcac ttgaggttag gagtttgaga ccagcctggc caacatggcg 28560 

aaaccctgtg tctactaaaa atacaaaacc tagccaggtg tggtagtgca cgcctgtaat 28620 

cacagctact caggaggctg aggcaggaga atcacttgaa ctcgggaggt ggaggttgca 28680 

gtgagctgag atcgtgccat tgcgcaaaga agatctaggc cggcccctca accggtgagg 2 8740 

tccaggctgg gagtgctgag agactgtggt gacactgaat gaactaacag gcaaagggct 28800 

tccaactgag cctgggggtg gtgggaaatg gctcttgtgt tctagtcaag acctctgcca 28860 

accagttctg acactgaccc agcacagaac ctgacaggtc agcaagggcc agggcttagc 28920 

acagcccagg taagggtgtg tgtacggccc ccagagtcac tcccaggctg caagaaaagg 28980 

gacaaaggag ggacaagggg tggccaagca aactgttccc tctgctcggg agtctggggt 29040 

gacctggcct agctggccag tggagctggg ccacctcccc ttaaactctc caccccggac 2 9100 

ttcgactcca aagctttcct gccacccacg ctctccccac ctgggatcac ggccaggccc 2 9160 

tgagccttca agggcccagg tgaactcagc cagactagga gctgaggagg acacagggca 2 9220 

gcttccagaa cggacccgag aaccactccc agcaggttct gcttccagac aaggagctgc 29280 

actttttcag ccaatgcaat tagaaagcca ggagaaggtg caaattccac ctgcctgagc 29340 

gtccgcactt cccaggccgc ccaccataca cacagcaaag atgtgtttaa ccattcaaac 29400 

ccatggccaa ccacatcggt tgcctcagac atgcaagttt taaaaaggaa cataactatg 2 9460 

ggccaggcac ggtggttcac gtctgtaatc ccagcacttt gggaggccga ggtgggtgga 2 9520 

tcacctgagg tcaggagttc gagaccagcc tagacaccat ggtgaaaccc catctgtacc 29580 

aaaactacaa aaattagctg ggcgtggtgg tgggcgcctg taatcccagc tacttgggaa 29640 

gctgaggcag gagaatcact tgaacccggg aggcgaaggt tgcagtgagc cgagattgtg 29700 

ccactgcact ccagcctggg caacaaggga gactccatct caattaaaaa aaaaaaaaaa 2 9760 

aaaaaggaac ataactatgg agtctcaagg ggaagtaatt ccttcaacaa taacaaatct 29820 

tgaaagctga gctctttttt ttttttgaga caggatctcc tcactttgtc gcccaggctg 29880 

gagtgcagtg gtgggatcac agctcactgc agcctcgatc tcccaggctc aaatgatcct 29940 

cctacctcag cctcccaaga agctgggatt acaggtgcat accatcacac ccgattcatt 30 0 00 

tttgtatact ttgaagagat ggggtctcac catgttgccc agtgtggtct tgaattcctg 30060 

gactcaggtg atctgcccgc cttggcctcc cagagtgctg ggattacagg cctgagccaa 3 0120 

cacccccacg ggttcatttt cagagtcgca ccgagtgctg gggttacagg cctgagccaa 3 0180 

cccccccacg ggttcatttt aagagtgaca ccgagtgctg gggttacagg cctgaaccaa 3 0240 

cccccccacg agttcatttt cagagtcgca ccgagtgctg gggttacagg cctgagccaa 3 03 00 

cccccccacg ggttcatttt aagagtgaca ccgagtgctg gggttacagg cctgagccaa 30360 

cacccccacg ggttcatttt cagagtcaca ccgagtgctg gggttacagg cctgagccaa 3 042 0 

cccccccacg ggttcatttt cagagtcaca ccctttttct gaaaaacaac ttgggctcat 3 0480 

gcaaattcga gagagagatg gtgacactcc ccgccccctg gacccaggtg gagtcgcagc 30540 

agggtttacc cgtgagcggg gtccaaggcg atggccctcg gctggtcaag gtcctgccag 30600 

aagagcacct tccgggatgt gccattgagg ttggccacct cgatgcggtt ggtctctgag 30660 

tccgtccagt acagcttctt gcccacccag tcgcaggcga ggccgtcggg agagaccagg 3 072 0 

ccggagatga ccacgttctg cacggcggcc cccgtctggt tcaggtaggt ctgcttgatg 30780 

gcctcctcgc tcacgtctgt ccagtacacg gctcccttgg aaaactggaa gtccactgcg 3 0840 

gccgcatcct ccaggccgct gaccacgatg gtggactcca gcttgactcc gccggcgtcc 30900 

accagccgta cgtcccggcg gttggcaaat agcaggagcg gcgaggctgt ggggcagaag 30960 

caaaccgtga gggccactgg ctaagccagc aagatacaca gccctgggat ggagcactat 3102 0 

gcccagagca ctcctggtac tgccctgccc atgcccaaga cctccagttc cttcctccca 31080 

cccctaaggc gttgtcagga agttgcctgg gcagccccgg cccgcatcat tcagaggctc 31140 

ctgcagcgca gcaaacagcc ttcttcccac attcggtgac agcacctgtt tgtttaccaa 31200 

ctgttacgtc tgttccccca gatatgggtg acccttcctg ccatgcccaa aacctcccac 31260 

atcgtcctcc agaggctaca ggggccctgt cctgttctgc agagaagcca catccccttt 3132 0 

gttggcctga cacaggggat ggggacatgc aggcacagca ctggccatgc tgctcgctac 31380 

agacccagcc acagggccac attttttgag gggttcagag cccaggccag acagagcctc 31440 

aagattccct tacaagtctt tgaccactgt ccaagctcag gcccgtttcc ttggccgtgg 31500 

catcagcttc ccatccaccc ctgtattcca tgtttctccc accctgcttc tggacattcc 31560 

tacatttaaa gggtcactct ggaatgccac cccttggctc agacaccttc cacagctccc 3162 0 

tgtgccagtg ccatgcagaa caaggtcaga ccccctagcc tggcctccaa ggccttggcc 31680 

tctggcctca cctacacttc tctccaccac cccaccccaa gcattcctga tctgcctgcg 31740 
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gccaggctgg ctccctcacc tccctgtgca ccgcagccct cagccccttc tgcctgtgca 31800 

agaagcctca tctcacagac aacggtctca ttcccacaac gggctcaatg agaaatcagg 31860 

agaggccttc agaccatcac cccaccagac acctcagacg tcggaccagg agggtccagc 3192 0 

aacccccaac acagactcag agggactaag aagccacatg aggagtgaac acaagatgtg 3198 0 

gacaggagga ggttaagggc ctccagggag ctccatcagt ccgtgttctg ctgtcagcag 32040 

ggttaggctg ggctggccac aaacaccccc aaaaaacatc tgaagccttg gcttgaaaca 3210 0 

gctgacattc ctcatgaaaa ctgcagaccc ctgggtcctc ctgcgcagat gggggagccc 32160 

agccaacccc acactcccac cttcaccaag aaagagaaag ccaaaacaaa ctcaactcag 3222 0 

ccaatgacaa tcacagaact gaatcctgta gttagttcag ttggtttcat ttcagcaggg 3228 0 

gaaagatttg cagcctctat gagggtagct gggaacacaa agggccagag catggcccag 32340 

gagaccccag cgcagtgggg tagatggttc cgagcacagg cctccctgcc aagacaagca 32400 

ctggctcaaa tcctggcccc tcccattccc aggagacatg ctccacagga tgggaggaca 32460 

cacagaggac ctgaggccag gaaaatgaca gcggcgcctc cgccgcccca cccgtgctgt 3252 0 

catcatctta ggtctacagt tctttgtggc aacgagggac actgtgaaag tcaaacaaca 32580 

ggaaggcata ggccacaaat aaagacaaac gggacttcat gggaagctaa agattttgtg 32 640 

catcaaaaga cactatcgag agagtaaaaa ggcaacccac agaatgagag aaaatatttc 32 700 

caaatcatag atctactaag agattaatat ccatgaaata cagagaactc ctaaaactca 32760 

acaatgagaa aacaactaag ccaactcaaa aatgggcaaa caacttgaac agacatttct 32 82 0 

ccaaagatga catataaatg gccaataaac acatcaaaac aggcttaata tatccctaat 32880 

catcagggaa atgcaaatca aaactacaat aagataccat cttgcaccaa ttaggacggc 32940 

tactatcaaa aaaacaaaat agcaagtgtt ggtgaggatc tggagcaact ggaacccttg 33000 

tgcaccactg gcaaaaatgt gaaatggtgc agctactatg gaaaacagca tggcagttcc 33060 

ccaaaaactt aaacacagaa ttaccatatg acccagcaat ttcgctttgg gttatatacc 3312 0 

caaaagaact gaaaacaggg acacaatcag atatgcatac accttggatc acagcagcat 33180 

ccttcccaac agctaaaaca tggaggcagc caggcatggt ggctcacgcc tgtaatccca 33240 

gcactttggg aggctgaggc gggtggatca cctgaggtca ggagttcgag accagcctgg 333 0 0 

ccaacatggt gaaaccccgt ctctactaaa atacaaaaat tagctgggcg tagtgacggg 33 360 

cacctgtaat cccagctact cacaagtctg aggcaggaga atcacttgaa ccctggaagt 3342 0 

ggacgttgca gtgagccaag attgcgccac tgcattccag cctgggtgac acagcgagac 33480 

tctgtctcaa aaaacagcaa aacaaaaaca aaaaaacaaa caaacatgga agcaacccaa 33 540 

gcgtccctct actgagggat gaatagcggg gcaaaatctg ctccatccac acaatggagt 33600 

actattcagt ctcaaaaagg aaaaagattc tggtcaggca cggtggctca tgcctgtaat 33660 

cccagcactt ggggaggctg aggcgggtgg atcacctgaa gtcaggaatt caaggcccgc 33720 

ctggccaaga ctggcaccna gctacacana aagtatangg ccccggaaa 33769 

<210> 9 

<211> 72049 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 

<222> (8356) , (8385) , (38585) 

<223> Identity of nucleotide sequences at the above locations are unknown. 
<400> 9 

tataccttgc gcggaccttc ggctcctgtg gtgaagacaa tatgaagaaa atagaaatta 60 

cccataattt tgccacacag acttagttgt gtccatgtat cttgtgcacc ttttttctgt 12 0 

ttacggatca aaatcgactt ttagggtcag gcgcggtggc tcacacctgt aatcccaaca 180 

ctttgggagg ctggagttgg ggttgggggg tggatcactg aagatcagga gtttgagacc 240 

agcctggcca acatggcgaa actccatctc tactaaaaat aaaagattag ccaggcgtgg 300 

tggtgggtgc ctctaatccc agctactccg gaggctgagg caggagaatc gcttgaaccc 360 

aggagacaga ggttgcagtg agccaggatc acgccactgc actccagcct ggcaacagag 42 0 

cgagactctg tctcaaaaaa aaaaataaaa ataaaataaa taaatacata aattgacttt 480 
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taggagattg gttcaaacaa tgtgtgtaat gttgtgtctg agtgtttttc atttatcgtt 54 0 

catgcaaatt ccgacatcat tcactcttct ccagagtgtg ctgttttcct gcctgtgtca 600 

tcacccgtca ccttgaatgc cctcgtttag gtaaaataag tacattttat tcaaaaatat 660 

ttgaggacat ttgggttgtc tccaggttct tggtcttgag ttttgctgtt cttgtggagc 720 

catggtggtg tctggttgca ggaacctcca tgcgttccag ctgctgcttc tgcctgtgtt 780 

cttagagagg aaatgctggg gtccgcggtt cccgggctgc tgaccaggaa gcctgcggtg 840 

ctttacggcc cttccagaag cgggagatgc ccccacttaa gtgtcagaca ggcctttcca 900 

cctcactggc agctctgagc ggctcccttc tatttgcaga tgactgagaa gttaccaatt 960 

tccacgttta ctgactgctg tttctcctgt taatttgtat ttatagtctt cgctaattta 1020 

ttgctagggt tttggtgttg tccctattga cttgtatgcc ttttaatttt ttaaacaaca 1080 

ttaatatact tcattttttt agagcagttt taagtttaca ggaaaattaa gggacaagta 1140 

cagagagttc cttccacctg ctgtcctcct ctcctcctcc ccaccttccc tccttcccct 1200 

attgtaactt tctttctgat attataaaag tcactcatgg ctgggcgtgg tggctcacgc 1260 

ctgtaatccc agcacgttgg gaggcagagg caggcagatc acctgaggtc aggagttcca 132 0 

gaccagcctg gccaacatgg tgaaaccccg tctctactaa aaacacaaaa agttagccag 13 80 

gcgtggtggc gggcacctgt aatcccagct actcaggagg ctgaggcagg agaatggcgt 1440 

gaacctggga ggcagaggtt acagtgagtc gagatcgcgc cactgcactc cagcctgggc 1500 

aataagagtg aagcttcgtc tcaaaaacaa agtcacacac gcttcttgta cgagggtcat 1560 

ttggccgagg ggccagatgg ctcaccatct agttgggaca ggccatgagc tcggaatgct 1620 

ttttacatat ttacatggtt gagaagaaaa tcaggagaat aatgttttgg gacatgggaa 1680 

aatgacatgg aatttgcatt ttagtgtcca taaatgaagt tttgtttgct cccagctgtg 1740 

ttgactgagg caggctggct tcctacagct gcggcagagc tgaggaggcg ggaaggagac 1800 

cgtgcaggcc gcagcaccga aaatatttgc tctctggccc ttcccagagt gcttgccgac 1860 

ctctgtccga cagctagaag gaaggatagg acccgtccga cgataaccac tgttgacatt 1920 

tgagcgcgtt tccttcccgg cttttgtgtg agagtggcag tctgtttgct tttgtggtcg 1980 

ggatctgctg cacgcacggc gggctgtttg catgaggctt cctggaggat agggctgggc 2040 

tcggagctgc acgcagtggg gcgtgtcctg catgcagtgg ggcctcagaa gagagctgtg 2100 

gtgggcgggg cagtgccaac gctggtgggt gccaggcctc cacgctcaga tcagccccgg 2160 

cgacaggttt gggccaccct ctctctggcc tctgtgcagt ggcccaggcc gtctgctctg 2220 

cctggcacac ttgcctctgt ccttccactg aagcgctcct cttaccctct gctcccggct 2280 

gggtacgttg aattgtgtcc ctcaaggaga tatgctaaag gtctaacccc aggaacctgt 2340 

gtatgtgatc taatttggaa acagggtctt ggctgatgta atcaagcgag gatgaggtca 2400 

ccctagagta gggggcctat atccacggtg ctggtgtcct catgagagca ggtgagcaga 2460 

cactgacact caggggtgaa ggctgcatgg agtcagaaca gggcttagtg cgatggcggc 252 0 

cacaagccaa ggaactccaa gtatttcctg caacaccaga agctggaaga tgccaggaag 258 0 

gatcctgccc tggagccttc ggagggagtc tgtccctgca gacgtcttga cttttgattg 2640 

cagggatgca tgtcttaggg tgtgtggggg ggtgcatttc tgatgttaga agccacctgg 2700 

ttggtggcga tgtgtcacgg gagccctctg caggttctgc gtgtccatgt ggtcggggac 2760 

agaggtgggc agggacggac ggtgtcgagc tggacatgtc catgacgtcg gccatccctt 282 0 

gggatggctt ttttgttttg aggataaggc tgcctgccag gaagctgtgc cctgcctggc 2 880 

ccttgcccca agcccctggc ctgtgcttgg cctcgcggaa gggatgtcgc ccttctctcc 2 940 

tgcatgcgtg cagggaggaa ggggagaggt cagcagcccg cctggaggag gctcgggcga 3000 

ggggaaggtt tcactttcag gcaatgttgt ggggctgttt aaacaacccc aaagaaaacc 3 060 

atttggccaa actgttagtt tccaaacatt ttacttcctt ggtgtttaaa taaattccta 3120 

ccaagactct gtagctggtc ccagggaagg agttggcctc tcttctttat agcccggcac 3180 

agtcagtccc ctgcacctgc ccctcccaac cccaggcctg cttccccgtg gccatggctg 3240 

ctgcccggac ctctctacac acagaacctc ctggaggcca gctgtgggca ccagccttgg 330 0 

cagggctgtg gcggagccca ggctgctggt actctctctg cagctgctcc ctgctggcct 3360 

ggctggacag cgtccccacc accactgggg tcacctctgt gctggtcaca gctcactcag 3420 

accttcaggc aaatgggttg gatcctgcct ctctcccagg tgtctcagtc tctgcaaaac 3480 

tcaaaaacct cagaggcctt gcagcctgag gggtgtcaga gacacctcct tcgaatcagt 3540 

aaacacctac agattcaccc cagcagtgaa aggactgctt cgccacagag gtttgattta 3600 

ctcctaagta attggaaggg atgccgagaa taggttcctc atggtgggac tagaggccct 3 660 

ctgctgacct agttaacaga gggctagggc tgggtgtgct cagcccctga aggttctagg 372 0 
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cccatttggg acaccccgcc agaacctgcc acaacctgcc atgtggtgac agctacctaa 3780 

atcccagagg ctcttgagct ggagagcaga cctctcaatc tcagcaggcc ccccacacag 3840 

accccataac cctagtctgc cttcacagta cagttcgtgg ctatgtgttc acggatggtg 3900 

ttgttcacct aaggtctctg ccctgtgacc ccaagggcgt cctgagggca gattccaagt 3960 

ctgtttcgtc cacccctcct tccctagcag cgggtccagg gcctggcctg aactagcttc 4020 

ccacagagat actggtggga tgatgaaggc agccaggcgg caagtgaaaa acgcacttcc 4080 

tgcatgtgct ggctcctggg attgaagtgt ttgaggaagc aaagtgaagt gagctttcct 4140 

cttgcggctg tgtgtccttg ggccgggagc ctaccctctc tgagcgttgg ggtccttgtc 4200 

agtagaatgg ggcatcctca tagctcaagg ggtggtgtgt gaaaattgtg ctattgtgtt 4260 

actttaatga tttttttttt ttcgagacaa agtctcaccc caacgcgcag gctggagtgc 4320 

agtggcgcga tctcagctca ttgcaacctc tgcctcctgg gttcaagtga ttctcctgcc 4380 

tcagcctccc aagtagctgg aattacagga gtgcgccacc aggcccggca tatttttcta 4440 

tttttagtag agagggggtt ttaccatgtt ggctaggctg gtcttgaact cctgacctca 4500 

ggtgatccac ctgcctcggc ctcccaaagt gctgggatta caagcatgag ccaccgcgcc 4560 

cggcctactt tagtgatttc ttaggaggac agagggaacg ggctggcaag acaggcttgg 4620 

aatgtgtttt gggatcaagt gccggtttct gtctggcact ggcgttctct gtggggccat 4680 

gatggacaca ctgctgaggt caagcgtgat tcgtcttgcg ctgtgcctgg cagtctcatt 4740 

ggaaagttct gtagacatcg tgtggatggg gctcttcccg gccaagccct tggggacctt 4800 

ccaggactgt gatctcccca cagtggctgt taagcaggga cctttcgtga agtggagtct 4 860 

ctggtcccct ccaagtcata gctagacagg gactcgggca tcgccaagcc tggctgatta 4920 

ttcactggat gaggagacag gcccagagag gggcaggaac ctgcccgagg tcacccagca 4980 

ggccccagag gtttcggtct cggattctcc ctgctcatcc ctggatgtag tgctgctgtg 5040 

gatgtggttc tgtgctgggg gctgtggaga gcagggggct tgtgccagga ccccagtgag 5100 

ggtggcgccc tcgccatgag gccgactgtt ggtatggggc ggccatccac tggggtgtgg 5160 

ggaggaacag ctttcctgag gaggaggtgg cgggaggaac agcttccctg aggaggaggt 5220 

ggcggtgctg tgtgacctgg gccttgaagg acaggtccat tgtcaacaga acattttggg 52 80 

agtggagcct agagggagaa aatttgttga aattcagatt cccctccccc taccaataca 5340 

caccaaatca gatgcccctg accagatcta aatttggctc tcagagattt ccattgtagc 54 00 

tgggcacttg gggaaccttc taagtgctgc ctctgcctct ccccagcctg cctgcctcag 5460 

tttccccagc cctgggcccg tgtcgctgtt gccatcacgt gggcgccctc tagtggagga 5520 

atcagattat gcactccggg gcttggagca ggagtcagga ggggctcctg tctttccttg 5580 

aaacgttgga tgccgggatc ctggaacagt ctctgcattc ctcctggcga gaaccagagc 5640 

ctgggcacag gggaccatct gttgtttgaa ggctgcagcc tggcagggca ctcaggagat 57 00 

ctggcagttg gctgcagggc caggtctagg ggccagggca tcagggaggc tctgggctgg 5760 

ttcagccccg ggcccctttg cagattgtga cc tgggcccc tgtgcagggg catggccaca 582 0 

ggatgctggg aggggtctct gaccctgacc ttcttggctc tgtgcatcct tgagaccaga 5880 

aaggtctgga acaaatgagt agacgatgcc ctaacctggg gagggagcca catcctgatc 5940 

ccagcaacct cgggaaggat ctgtcaggat tatggggcac cctgggggcc ccaagtctgc 6000 

atgggtctcc acttgcaatt tctgtaggaa gctctgataa atccaaactg ggggtcctag 6060 

gacacagtca gaaatgctga taccgttgtg tgtggagcct cgggccctgg gggtcaggag 6120 

catgtggagg gtgggccacg ggggttcaga agagaatcct gtaacccccc accccccaaa 6180 

ctgaagccca cttgagggcc atggctgaaa ggttgggggg tctccgtgcg tcctgtggag 624 0 
tgggtggtga ggagtccttg ggtttgcacg cctctgggcc tgagcggcgg gaccccgtcc * 63 00 

acagcggatc cctgggccct gttgctcaga tgctctcaga gtgttgctgt ggccacggag 63 60 

ggagcctgag ttaagcttct cttgtgccgg ttgtacgctg tcaggtcaca ctggtgagtt 6420 

aggcagggca cagatgccca gagcagaggg aactttcctt ggggattcaa cacgtgcaag 64 80 

tcttaggggc tggcaaatcc tgccctcagc tagagagggg gcttttattt gagaccagaa 6540 

tcacctgagc atcctcctgt ccccagctgt gtccagcctg tctgcaggga catcctgaga 6600 

ggaccaggct ctcccctcat ccacctgcct aagtgccact ctgaaccctg tccacctgtg 6660 
ccgtggaggg gcgtgacctc aagctgctca gccagcagca ggcttggccc tggggggcag - 6720 

cagagaccca ggtggctgtg gggtgggtgc ttcgtggcgt ggttctgaaa cttcgttgga 67 80 

agtgtgtgga cagtgccttg cctgttctct gtgggaccct atttagaaac gaggtctgag 6840 

ttactggggg tcatcactgt gttctgatgg cccagctgtg tggaggccgc ggtgcagccc 6900 

catccaagga gccagggccc tgggtctagc cgtgaccaga atgcatgccc cggaggtgtt 6960 
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tctcatctcg cacctgtgtt gcctggtgtg tcaagtggtc gtgaaactct gtgttagctc 7020 

ttggtgttcc tgaaagtgcc cccgggtctc aggcctcaga accagggttt cccttcatct 7 0 80 

cggtggcctg ggagcatctg ggcagttgag caaagagggc gattcacttg aaggatgtgt 7140 

ctggccctgc ctaggagccc cccggcacgg tgctggggcc tgaagctgcc ctcgggtggt 7200 

ggagaggagg gagcgatgaa gtggcgtcga gctgggcagg aagggtgagc ccctgcaagg 72 60 

tgggcatgct ggggacgctg agcagcatgg ccagcagctg ggtctgcagc ctggtacccg 7320 

gcgggacttg tggttggggc tggtttgtgg ccaggagagg ggctggcagg agacaagggg 73 80 

gactgtgagg cagctcccac ccagcagctg aagcccaatg gcctggctgt gtggctctca 7440 

gctgcgtgca taacctctca gtgcttcagt tctctcattt gtaaaatgag gaaacaaaca 7500 

gtgccagcct cccagaggtg tcatgaggat gaacgagtga ccatgtagca tgggctgggt 7560 

gcgtgtcacc taacatcacc agcctttgca aggagagccc tgggggcctg gctgagtatt 7620 

tcccttgccc ggcccacccc aggcctagac ttgtgcctgc tgcaggccct tgacccctga 7680 

ccccattgca cctgtctcca caggagccga ggaggtgctg ctgctggccc ggcggacgga 7740 

cctacggagg atctcgctgg acacgccgga cttcaccgac atcgtgctgc aggtggacga 78 00 

catccggcac gccattgcca tcgactacga cccgctagag ggctatgtct actggacaga 7860 

tgacgaggtg cgggccatcc gcagggcgta cctggacggg tctggggcgc agacgctggt 792 0 

caacaccgag atcaacgacc ccgatggcat cgcggtcgac tgggtggccc gaaacctcta 7980 

ctggaccgac acgggcacgg accgcatcga ggtgacgcgc ctcaacggca cctcccgcaa 804 0 

gatcctggtg tcggaggacc tggacgagcc ccgagccatc gcactgcacc ccgtgatggg 8100 

gtaagacggg cgggggctgg ggcctggagc cagggccagg ccaagcacag gcgagaggga 8160 

gattgacctg gacctgtcat tctgggacac tgtcttgcat cagaacccgg aggagggctt 8220 

gttaaaacac cggcagctgg gccccacccc cagagcggtg attcaggagc tccagggcgg 82 80 

ggctgaagac ttgggtttct aacaagcacc ccagtggtcc ggtgctgctg ctgggtccat 8340 

gcgtagaaag ccctgnaaac tggagggagc cctttgtccc cctgncttca gtttcctcat 8400 

ctgtagaatg gaacggtcca tctgggtgat ttccaggatg acagtagtga cagtaagggc 8460 

agcctctgtg acactgacca cagtacaggc caggcctctt tttttctttt tttttttgag 8520 

atggagtctc actctgtcgc ccaggctgga gtgcagtggt gtgatctcag ctcactacaa 85 80 

cctctgcctc ctgggctcaa gtgattctcc tgcctcagcc tcctgagtag ctgggattac 8640 

aggtgcctgc cactgtgctt ggctaatgtt tgtatttttg gtagagatgg ggtttcaccg 8700 

tcttggccag gctggtcgca aactcctgac ctcaggtgat ccacctgcct cagcctccca 8760 

aagtgctggg attacaggca tgagccacca cgcccggtca ggccaggcct cttttgaaca 8820 

ctttgcacac catgggtctt ttcatccagg ggggtaggta cagttgtaca gttgaggaca 88 80 

ctgaagccca gagaggctca gggacttgcc cagggtcaca cagcaggatg tggcaggtgt 8940 

ggggctgggc ctggcagcgt ggctccagct ttccagcata gaaatctgtg aaagcagata 9000 

gtttgtcggt cggtagggga gactttctga gacccgcccc agcggctcag agggtagtag 9060 

ccaggggcct tcctgggggc tcataaccca gaacactgaa tgggaaaacc ctgatggagg 912 0 

aggcgcagtg gagctgtggg tgccgatggg aagtcccaga ggagctggga ggtcagtagc 9180 

ggtgctgccc tctgtggagc acttagtggg caccaggtgt gtttccaggt tcatggccct 924 0 

gggacctgaa gctcagaagg tgaagtaact tgcccagggc acccgtcggg cagcggcggg 9300 

cagaggattt gtgggctgtg gagcctgtgc tcgtggccca gccctggggg ttgtgagtgt 9360 

gctggccggg gagcttttcc tgcaagtgga ctggtgtcta ggagccagca tgtcaggcag 9420 

caggcagcgg gagtgcagca ggcagcggga gcacagcagg cagagggcgg ggctcgagca 9480 

gccatccgtg gaccctgggg cacggaggca tgtgggagag ggctgctcca tggcagtggc 9540 

tgaagggctg ggttgtgccc cgaggagggt ggatgagggt aagaagtggg gtccccaggg 9600 

gctttagcaa gaggaggccc aggaactggt tgccagctac agtgaaggga acacggccct 9660 

gaggtcagga gcttggtcaa gtcactgtct acatgggcct cggtgtcctc atctgtgaaa 9720 

aaggaaggga tggggaagct gactccaagg cccctcctag ccctggtttc atgagtctga 9780 

ggatcccagg gacatgggct tggcagtctg acctgtgagg tcgtggggtc cagggagggg 984 0 

caccgagctg gaagcgggag gcagaggggc tggccggctg ggtcagacac agctgaagca 9900 

gaggctgtga cttggggcct cagaaccttc acccctgagc tgccacccca ggatctgggt 9960 

tccctccttg gggggcccca gggaacaagt cacctgtcct ttgcataggg gagcccttca 10020 

gctatgtgca gaaggttctg ctctgcccct tcctccctct aggtgctcag ctcctccagc 10080 

ccactagtca gatgtgaggc tgccccagac cctgggcagg gtcatttctg tccactgacc 10140 

tttgggatgg gagatgagct cttggcccct gagagtccaa gggctggtgt ggtgaaaccc 102 00 
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gcacagggtg gaagtgggca tccctgtccc aggggagccc ccagggactc tggtcactgg 10260 

gcttgccgct ggcatgctca gtcctccagc acttactgac accagcatct actgacacca 10320 

acatttacaa acaccgacat tgaccgacac cgacatttac cgacactgac atttaccaac 10380 

actgtttacc aacactgaca tctactgaca ctggcatcta ccaacactga catttaccga 1044 0 

cactgacatt taccaacact atttaccaac actgacatct actgacattg gcatctacca 10500 

acaccaacat ttaccgacac caacatttac caacactgaa atttaccgac accgacattt 10560 

accgacaccg tttaccaaca ccgacgttta ccgacaccga catttaccga cactgatatt 1062 0 

taccaacact gacatctact gacgctggca tctactgaca ccgatgccag catctaccaa 10680 

caccgacatt taccaacact gacatttacc aacactgaca tttaccgaca ttgacattta 10740 

ctgacactga catctactga cactggcatc tactgacact gacgtttacc gacactagca 10800 

tctactgaca ctgacattta ccaacaccag catctaccaa caccgacatt taccaacact 10860 

gacatttact gacactgata tctactgaca ctggcatcta ctgacaccaa catttaccaa 10920 

caccagcatc taccaacacc gacatttacc aacaccagca tttaccaaca ccgatgttta 10980 

ccaacgccga cgtttaccga cgccagcatc taccaacact gacatttacc gacaccgaca 11040 

tttaccgaca ctgacattta ctgacactga catctactga tactggcatc taccgacact 11100 

gatatttacc aacgccagca tctactgaca ctgatgttta ccaacaccga catttacgag 11160 

caccgacatt tactgacacc aatatttact gacatcaaca tttagccatg tgatgggggc 11220 

cggcttgggg gcaggccttg ctcttggcac tggggatgct gcagagacca gacagactca 11280 

tggggtcatg gacttctgct tcttctccag cctcatgtac tggacagact ggggagagaa 1134 0 

ccctaaaatc gagtgtgcca acttggatgg gcaggagcgg cgtgtgctgg tcaatgcctc 11400 

cctcgggtgg cccaacggcc tggccctgga cctgcaggag gggaagctct actggggaga 11460 

cgccaagaca gacaagatcg aggtgaggct cctgtggaca tgtttgatcc aggaggccag 1152 0 

gcccagccac cccctgcagc cagatgtacg tattggcgag gcaccgatgg gtgcctgtgc 11580 

tctgctattt ggccacatgg aatgcttgag aaaatagtta caatactttc tgacaaaaac 1164 0 

gccttgagag ggtagcgcta tacaacgtcc tgtggttacg taagatgtta tcattcggcc 11700 

aggtgcctgt agacacagct acttggagac tgaggtggga ggatcgctgg agtccaagag 11760 

tttgaggcca gcccgggcaa aggggacaca ggaatcctct gcactgcttt tgccacttac 11820 

tgtgagattt aaattatttc acaatacaaa attaagacaa aaagttaatc acatatccac 11880 

tgccctgctt aagacagaaa acatgggtgt tgttgaagcc agaggcagct gctggcctga 11940 

gtttggtgat tggttcctaa gcagttgaag gcagttttgt ttttccatag atgtctgttc 12000 

tccctttgct gggtgcagcc tcgccctgct gctgtggtcg ggtttcagtg gcctcgtccc 12060 

gtggacgcag cctcgccctg ccgctgtggt cgggtttcag tggcctcgtc ccgtggacgc 1212 0 

agcctcgccc tgccgctgtg gtcgggtttc agtggcctcg tcccgtggac gcagcctcgc 12180 

cctgccgctg tggtcgggtt tcagtggcct cgtcccgtgg acgcagcctc gccctgccgc 1224 0 

tgtggtcggg tttcagtggc ctcgtcctgt ggacgcagcc tcgccctgcc gctgtggtcg 12300 

ggtttcagtg gcctcgtccc atgggcgtgc tttggcagct ttttgctcac ctgtggagcc 12360 

tctcttgagc ttttttgttt gttgtttgtt tttgtttgat tttgtttgat tgtttgtttt 12420 

tgttgtcgtt gttgttgccc aggctggagt gcagtggcgc gatctcagct cactgaaacc 12480 

tctgcctcct tgggttcatg ccattctcct gcctcagcct cccacatagc tgggattaca 12540 

agtgcccgcc accacgcctg gctaaatttt gtatttttag tagacagggg gtttcaccat 12600 

gttggtcagg ctggtctgga actcctggtc tcacatgatc cacctgcctc ggcctcccaa 12660 

agtgttggga ttacaggcgt gagccaccgc gcccagccct ctgttgagca tattttgagg 12720 

ttctcttggt gccagtgata tgtacatgtg tccccatcgc accatcgtca cccattgagg 12780 

tgacattggt gcctctcctc ggggtggatg cctccctctg tttccagcaa cttctgaagg 12 84 0 

attttcctga gctgcatcag tccttgttga cgtcaccatc ggggtcacct ttgctctcct 12900 

cagggctccc aggggaggcc cgaatcaggc agcttgcagg gcagggcagg atggagaaca 12 960 

cgagtgtgtg tctgtgttgc aggatttcag accctgcttc tgagcgggag gagtttcagc 13 02 0 

accttcaggg tggggaaccc agggatgggg gaggctgagt ggacgccctt cccacgaaaa 13080 

ccctaggagc tgcaggtgtg gccatttcct gctggagctc cttgtaaatg ttttgttttt 13140 

ggcaaggccc atgtttgcgg gccgctgagg atgatttgcc ttcacgcatc cccgctaccc 13200 

gtgggagcag gtcagggact cgcgtgtctg tggcacacca ggcctgtgac aggcgttgtt 13260 

ccatgtactg tctcagcagt ggttttcttg agacagggtc tcgctcgctc acccaggcga 1332 0 

gagtgcagtg gcgcaatcac ggctcgctgt agcctcaatc tccctgggct caggtgatcc 13380 

tcctgcctca ccctctgagt agctgggact acagacacat accaccacac ccagctagtt 13440 
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tttgtgtatt ttttgtgggg ggagatgggg tttcgctgtg gtgcccaagc tgatctcaaa 13500 

ctcctgaggc acaagcgatc cacctgcctc ggcctcccaa agtgctggga tgacaggcat 13560 

cagccgtcac acgcagctca atgattttat tgtggtaaaa taaacatagc acaaaattga 13 62 0 

tgattttaac cattttaaag tgaacagttc aggctgggcg tggtggctta tgcttgtaat 13680 

cccagtactt tgagaggctg aggtgggcag atcacctgag gtcaggagtt tgagaccagc 13 74 0 

ctggccaaca tgatgaaatc cagtctctac taaaaataca aaaattagcc gggcatggtg 13 80 0 

gcaggtgcct gtaatcccag ctactcggga ggctgaggca ggagaatcgc ttgagcccgg 13 860 

gaggtggagg ttgcagtgat ctgagatcat gccactgcac tccaatctgt gtgacagagc 13 92 0 

aagactctgt cttgaaaaat aaataaataa aaaaaatttt aaaaagtgaa caattcaggg 13980 

catttagtat gaggacaatg tggtgcaggt atctctgcta ctatctactt ctagaacact 14 040 

ttcttctgcc ctgaaggaaa ccccatgccc accggcactc acgcccattc tcccctctct 14100 

cccagcctct gtcaaccact aatctacttt ctgtctctgg gggttcactt cttctggacg 14160 

ttttgtgtga ctggaatcct gcaatatgtg gtccctgcgt gtggcttctt tccatagcat 1422 0 

tgtgttttcc agattcaccc acacattgtc gcacgttatc agaatctcat tcctgactgg 14280 

gtgcagtggg ttaggcctgt aatcctaaca ttctgggagg ccaaggcggg acgatcactt 14340 

gaggcaggag tttgagacca gcctggccag cctagcaaga ccccagctac caaaaaattt 1440 0 

taaaagttaa ctgaacgtgg tggtggtggg cacttgtggt tcccagctac ctgggaggct 14460 

gaggttggag gatcgcttaa gcccaggagg tcaaggctgc agtgagctat gatcgcacca 14520 

ctgcactcca gcctggacaa cagagcaaga ccctgtctga aaaaaaaaac aaaaaaaaaa 14580 

gttcctttct ttttgtggct ggatgacatc ccattgtatg gccacagcac attttgtttg 14640 

tctgtttatc gggtggtggg cagtggtttc caccttttgt ctcctgtgaa taatgctgct 14700 

gtgaacattt gaattcaagt ttttgtttga acacctgttg tgaattattt ggatatatgt 14760 

gtaggggtag gattgctgag tcctatggta atgttaggtt tgacttactg aggaaccatt 14820 

aaactgtttt caacagtggc tgcgccgttc tgcatcccca ccggcagtgt gtgagggttc 14 880 

tgactttacc tcctcacaaa cgcttctttt ccatttaaaa aaatattcag ccaggtgctc 14940 

tggctcacgc ctgtaatccc agcactttgg gaggccgtgg cgggcggatc acctgaggtc 15000 

aggagttcga gacgagcctg gccaacatgg tgtaacccca tctctaccaa aaatataaaa 15060 

attagccggg tgtggcagcg ggcgcctgta atcccagcta cttgggaggc tgaggcagga 1512 0 

gaatcacttg aacccgggag gcagaggttg cagtgagcca agatcgcgcc actacactcc 15180 

agcctgggtg acaagagtga aactccatct aaaataaaac aaaaataaaa ataaataaaa 15240 

atttattaaa acattcatca cagccagcct agtgggtgtc ccatgtggct ttgcctcgca 153 00 

tttccctgat aactaggatg ctgagcgtct tgtcccaggc ttgccacacc tcagcacttt 15360 

gagatacgtc gcacagtccc catttgcgaa cgagaaatga ggtttaggga acagcagctg 1542 0 

tgtcatgtca cacagcgagc agggggtctc tgagccgtct gaccccacag ccgaccaagc 15480 

tccaatcctt accgcctcct agtgttgtgg atgtagccca gggtgctccc acatttttca 15540 

gatgagaaca ccgaagctca aaacaggagc gttttgtcca cattggatac acgatgtctg 15600 

tggtttggtc ctgaagtcac tttatatctc agtggtccag actggagtag gacagggggt 15660 

tctggggaat ggggaaggtg tctcaggtga aaggaaggaa ttccagattc tccatactgt 15720 

ccttgggaag ttagaagact cagagggtct ggcaaagtca gacaaagcaa gagaaatgca 15780 

gtcaggagga agcggagctg tccaggaaca ggggggtcgc aggagctcac ccccaggaac 15840 

tacacttgct ggggccttcg tgtcacaatg acgtgagcac tgcgtgttga ttacccactt 15900 

tttttttttt tttgaggtgg agtctcgctc tcttgcccag tctggagtgc agtggcacga 15960 

tctcggctca ctgcaagctc tgcctcccgg gttcatgcca ttctcctgcc tcagcctccc 16020 

gcgtagctgg gactacaggc gcctgccacc gcgcccggct aatttttgta tttttagtag 16080 

agatgggatt tcactacatt agccaggatg gtctcgatct cctgacctca tgatccgccc 16140 

gtctcggcct cccaaagtgc tgggattaca ggcgtgagcc accgcgcccg gcccgatttc 162 00 

ccactttaag aatctgtctg tacatcctca aagccctata cacagtgctg ggttgctata 16260 

gggaatatga ggcttacagg ccatggtgct ggacacacag aagggacgga ggtcaggagg 16320 

tagaagggcg gagagaggga acaggcggag gtcacatcct tggctttcaa aatgggccag 163 80 

ggagagacac cctctgagca tggtaggaca ggaaagcaag attggaacac attgagagca 16440 

accgaggtgg ctgggcgtgg tggcttacgc ctgtaatccc aacactttgg aaagctgagg 165 00 

tgggtggatt gcttgaggcc aggagttcaa gaccagcctg gccaacatgg tgagaccccg 16560 

tctctactaa atatacaaaa attagccagg cgtgatggtg catacctgta atcccagctg 16620 

cttgggaggc tgaggcagga gaattgctta aacctgggag gcggaggttg cagtgagccg 16680 
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agatcccgcc actgcactcc agcctgggcc acagagtgag actccatctc aaaaaaaaaa 16740 

aaaaaaaaga taaaaagacc aaccgaggaa ttgaagtggg ggggcgtcac agtagcagaa 16800 

gggggatcgt ggagcaggcc accctgtggt catgcactgg aagctcatta cctgacgatt 16860 

tggagctcat cactgggggc ctaaggagaa tagatactga aggatgagga gtgatggcgc 1692 0 

ggggcacggg tgtctttggt ggccagaact tggggactgc tggggtgcct cactgcaggc 16980 

cttctcagcg ccctttatat gcttacacag gctgtttcta agagggggat acattgcata 17040 

agcgttttca gactacctca tcatgggtcc ctttctttac cctctgtggc cctggtggcg 17100 

cactctctgg gaaggtgcag gtggatgccc agacccgccc tgccatccac ctgcacgtcc 17160 

agagctgact tagcctcgag attgctgctg gcacctcctg ccccgggaca cctcggatgt 1722 0 

gcccgtggag atgctggctc tgtgttttct gctggagttt ggtgcgtctt ttcctcctgc 17280 

aagtggccac cgctcttggg tatgtcctca ggcttctgcg agtcatggct gcttctcagg 17340 

tccttgccca gcgccaggag caaaccctcc tggcactttg ttcaggggtg gatgcgccag 17400 

tgttcctgct gtggaccgcc atctcacatg agggtcttgg gcctgcaggc tcgttcagga 17460 

aacacccgct gagtatgcag tgtgtgccag ctgtgtccca ggcaatggcg gggacagtgg 1752 0 

ctgctgctgg ggttgtggtg gcttctgggg actctgggga cagctgaggt gcaaggagcc 17580 

acggctcctt gaggatgcag ttggactcca ggtggaaggg atggttgggg gaggtataaa 17640 

tggggtcagg gaggagacac atttggaaca atgggaacat ttttaagatg ctatgtcggg 17700 

aggcaacaag gtggccaacc caggtgctga ggagcccaca ccagccctgg acgtgttttg 17760 

ccgctcacct ttgctgggga gtggtgggag agaggattcc gttccacgtg gtggtgtgcg 1782 0 

cagctgggct gtgtggagct gggcgctagg aggaaggtgc tttctgcggg gctagccggg 17880 

ctctgccttt gaacacaatc aggctccagg ttttcagcat ccagtgcatg agaggacttc 1794 0 

acgggcagct gtggctgatc ccttgatgaa ttgggagaag aacaaaggtc tatgaaatga 18000 

ggtttcatgt agatggcatt agagacgccc acaacagatt tacagagtgg agcggagacg 18060 

gcggatgggt ctgggaggcc cctcctgctg gccttgactg tgacagctgt cctgggaatc 1812 0 

agcttccagg ccgccccagc agcctgactg acacacacag gggttttagc cccatcctgc 18180 

gaccagctgt tgccatcatc agtgacagct gggagtggcg gtggttccag ccctgggcac 18240 

cctccccacc tgctggggcc cacccagggc agtcctgaca cctacaggtt gcttggagcc 18300 

gcatccgagt cctgccccac cacgtgtgaa gcccgagtgg tcgtgggctg aggtcccctg 18360 

attgcatccc cacttccctt ctgcttcaca tagctgcctc ttctcaccgt ttttccagcc 1842 0 

tcctgggcta ggaattccag tgttgtgctg gctttgcccc aggacacctc cttagccctc 184 8 0 

ttcctgagtc tagagccccg ggggttggaa gtcctggccc ctgggacacc tgcagccaca 18540 

ctcagcttct cctgtgagcc tccagcatgt cccctcagga ccaagccctc acgttcttgc 18600 

ctccccgccc acctgggctc agccagggga aggcctggct gggagcgtct cccctctgcc 18660 

ctgcccttct cccctcctac cctgcccttc tctcctctgc cccgccatgg cttttatatc 18720 

ctgtgccaca agacatggct gtgtgtgaaa gtggcagggt ctggcatctc tgtgggtctc 18780 

tgaggcccac gctccagtgc cactcttccc acccgctggc cgtgccctca tgctggaggg 18840 

acagcccagc cctctcccga accccagccc catgtgccca gctgcccccg gccctctccc 18900 

ctggaagccg gggtcactcc agccgtatgc catggtgggg acatcctgct tccttggcct 18960 

tccagggaag gtcctctttc caaatggcga cacctggtcc ctgcctggag gctggaagct 1902 0 

gtggcccttg tatgcccctc cagggtctgt gcgctcggtt ggcccgagtt cccatcaccg 190 8 0 

tcatcatcac catcatcatt gtcatttcgc ttgtctgtga gccggcctgg tctcccagag 1914 0 

cagagaccct ctgaggtcca gcctgagttg gggtctccgt gctgacccct gacggggact 19200 

caggacgtac caggtctggg tcaggagtga cccccaaacc tcgtgccctt tgacaggcac 19260 

ccctgacttt tgctaagtgg gtggaggtga catcacttac agcgggagtg atgggacagg 19320 

gtctgttggc tgcactgtgc tcccagggat ctggggagag gctatatccc tgggctttgg 19380 

cactgcagag ctgtgtgtgt ttgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 19440 

gtgtgtgttt gcgtgcgcgc acatgtgtat aagatctttt tttattacat gaagcaagat 19500 

aactgttgct gtttcctttt gggttttgtg ttcaacagag tggggtactt cttccctcag 19560 

acaacagaac tctccccttt aaacacgtgc tgtcagaggg tgggtcttgg gctcatgtct 1962 0 

gtttgcacag ccgagtcaga ggaaacacag ggttcttcat aaaaacactg cacagcaggc 19680 

gactgtccag agtcagcctg caggacggca gcagccctgc ccctcagagc acagctaggg 1974 0 

tgggctgctt tgggatctcc cgtcattccc tcccagctgg cagccggcgg ccggcccatt 19800 

ccttggtgtg ctggtcaggg gggcgtgcgc ctgctctgct caccctggga atgggacaga 198 60 

agctggcagc tcggagagga cagggctgga cccttgggtg gcctctggct ggaccatctc 1992 0 
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attgtcctca gacacagcct ctcgggtcta gtttcatttc ctgaaaaaca agtgcacaga 19980 

actagagcag gagtcgagag ctacggcccc cgggccagat ccagccctgc cacctgtttt 20040 

cacaccatgc tcaagctgag tgggttttac attttttaat tacttgaaaa aaaaaaagcc 2 0100 

aaaggaggtt tcatgaccca tgaaaattat atggaattca aaaaaaaaaa attatatgga 20160 

attcaaattt cagtgtccat aaataatttc ttgagacagg gtctcgctct gtcacccagg 20220 

ctggagtgca gtgctatggc atggctcgct gtacccttga cctcccaggc tcaagcgatc 202 80 

ctcctgtctc agcctcctga gtagctggga ctacgggtgt gtgccaccaa gcccggctaa 20340 

ttttttttta attttagtaa agacagggtc tttctatgtt gcccaggctt ttctggaact 20400 

ccatcttggc ctcccaaagt gctgggatta caggctcgag ccacggagcc cagcctgttt 20460 

ttgttttttc actgataaag ttttgccggg tgtggtagtg tgtgcctcta gcgatttggg 20520 

aggctgaggt gggaggatcg cttaagccca ggagtttgag gctgggctca agtgatcagg 20580 

aggtgaacta tgatcatgtc attgcattcc agcctgggtg acagagcaag aacctatctc 20640 

ttaaaaatat atatttaaaa agtattgggt gtggtggctc acgcctgtgg tcccagctac 20700 

ttaggcatct gaggtgggag gatggcttga gcccaggagt ttgaggttgc agcgagccaa 20760 

gatcgtgtca ctacactcta gcctgggtga cagagcccag accctgcctc tttaaaaaaa 20820 

aaaaccaaaa aacatgtatt ggaacacagc catgcctgtt cagtcacgtg ctctccatgc 20880 

tgctttctgc tccagagacc cttatggcct gaaagctgaa aatattttct atcctttaca 20940 

aaaaagtttg ctgacctctg tcctggaaaa ttcatctccc aagttctctt ccggcactgg 21000 

cgttcctggg tgtcctaaat ttggcccctg ttatttctga actctgtttt ggctctgttc 21060 

cctcccagga gccaggacag gcacgttctc tgcatcttgt cccctgacgc ccagaggctt 21120 

ggctcggctc aggcattctt ggaaatatct ggctccagga aaggcagagg cctcctgagt 21180 

cggcccagag ggaacctgcc ccaggtctgg gggaggcctg acccagcaga gtggcttttg 21240 

ccgatgggtt gggccggtca agatgtgctg aaagttgtcc tcagaaggcc actttgggat 213 00 

tccttcctcc agtattagag caactgagag ctgctcattg caagcctgat gttttcccag 213 60 

ttggccgggt ccaccgggtg ccctgggatt ctgggatctg ggtggaaagt agggggcttg 2142 0 

ggggagtgtc ctgggttctg gaatccaggt ggcaagtggt gaggttcagg gagtggcttc 21480 

tgagccacca taggggtctc tgtgggaggc tctgcccatc caggagattc cgcaggccct 21540 

gccggcccag agccagcgtc ttgcgcttgc cgaggctaca gccagcccca gccgggtgga 21600 

acagcccgtc gcctcctctc actttgtttt ggggccacct gggagtgtgg agcaagggta 21660 

gagagggagg aagtggctgc cggccgctgc ccagcaccct tgtttgcctt gggccctctg 21720 

tgggctcctt tttattgctc ttcaatgaag ccagggaaat ggacttcctt gcctcacttc 2178 0 

agttcaacat gtctggaagt ttggtattaa aattaagaaa gtgtggaaat agagcaagaa 2184 0 

gagaaaaatc tctccaagag ataatagtga cctctgagct gggcgcggtg gctcacgcct 21900 

gtaaatccca gtactttggg aggctgaggc gggcagatca cctgaggtcg ggagtttgtg 21960 

accggcctga ccaagatgga gaaaccccgt ctctactaaa aataaataaa taaataaata 2202 0 

aataaataca aaattagcca ggcatggtgg cgcctgccta taatcccagc taaggcagga 22080 

gaatcgcttg aacctgggag gcaaaggttg cagtgagcca agatcacgcc attgcactct 22140 

agtctgggca acaagagtga aactccgtct caaaaaaaat aaataaataa aaaataaaaa 222 0 0 

tagtgacctc tggccaggtg tggcagctca tacccgtaat cccagcactt tggaaggaag 222 60 

gccgagatgg gcagattgct ttagcacagg agtttgagac cagcctggcc aacatggtgg 22320 

aaccccatct ctacaaaaat agaataaaat ttaagaggta atagtgacct tttggtagat 223 80 

cgaaacctgg attgctttct ttttctaaat gctgattctt ttctttgtgg tgtttgtgtt 22440 

ctgtgccgat gtccctcccc cagccctgtt attgtgagtg gaagaagggg aaagggttcg 22500 

cccgctactg tgagcccctc ctctcacgct gggtgtcctt ggagaagcct gcacttcttc 22560 

attgtacgcc agggctgggt ccctccctgg agtggttctg tgctgctggg atggggccaa 2262 0 

cccctcagat gttttctgag tgtcacacac aggtgtgtgc attcatggcc tttgcgtgtc 22680 

ttcctgttgt ggaggcaaaa atgtgaagaa ccctagatga ttttgggacc agggctccat 22740 

cacctgctgt tcattgcaca ccggagcatc caggcatggg tggagagctc agacttccag 22800 

gcacggtcgc aggggctggt ctaaccatgt tcccgcccgc ctgctcgtca gaaccgcctg 22 860 

ttgggagctg ttatcatgat accatacctg ggccctgggc tatccgattc tgacttaatt 2292 0 

gctccaggtt ggggccaggc cgttgtttgc tgttttgttg tttcttctgt gacgttagcc 22980 

actgggctaa tctgagcccc tcagttacag gtggagaaac tgagacccat gggggtgcaa 23040 

ggacttgccg aggacccaga gccccttggg ggcagagctg aggcggggcc tggctttggg 23100 

tcccagagct tccagtcccc ttcccgctct cctaacagct tttttttttg agacaagatc 23160 
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tcaccctgtc acccaggctg gagtgcaatg gcatgatctc ggctcactgc aatcttcgct 2322 0 

agctgcgttc cagcgattct cctgcctcag cctcccgagc agctgggatt acaggtgtgt 23 280 

gccgccatgc ccagctcgtt tttttttgta cttttagtag agatagggtt tcaccatgtt 23 340 

ggccaggctg atctcgaact cctgacctca aatgatccgc ctgcctcggc ctcccaaagt 23400 

gctaggatta caggctggga tcacactgtg cctggcccta gcagctttgt cctgtgccat 23460 

ccaacaacag atgaccgaag tctttgtttc ttaacatgca ttccatctgc cttacagttt 23 52 0 

tgccacctgc aaaacagagg acttgtcgct tttctggtaa gctggaaatg taatctggta 23580 

gcaggaggcc tgtggaagct tgcctttaat ggccttgtgt ctctttcatc ctgtcctgag 23 640 

agccggagaa cttggatgtt gcacctaact caaccttcct gttaacatac agttctgcag 23 70 0 

gctcatggat catcagaacc acgtcctatc tcacgcggct gtatgcttcc gttggttcag 23 760 

gtgtttttac cttgacagta ttttctcctc ggtggctttt gcggtggttg cttttaatca 23 82 0 

gcattgactc ttcaagaaaa atatttagct gctacatctc agaggagaca gggtggaaag 23880 

catctgagac ctgcaggctc agacttagaa ccagaagtgc cctcagagtt catccggccc 23940 

tgacccagcg ggaaatgagt tcacagagaa gcgggagaac tttgccccag gccctgccgt 24000 

tgctcataac tgccccaggt ccttacattt gctccaggtc ctgccccagg ccctgcagtt 24060 

gctcataact gccccaggtc cttatatttg ctccaggtcc tgccccaggt cctgcagttg 2412 0 

ctctgtgtgg tgggtgtgat ctggagccct ccgcccattg ctgcacctgg ggcaggcatt 2418 0 

gctaattgat cccaggactc cttcctgcgg agcacgccct ggttctccag gcagccgctg 24240 

cctgtcagcc tgcagtggtt cgggagagga cacctgcttg cctggtctgt tccaaatctt 243 0 0 

gcttctcatc ccagcacagg tagggggtgc tatgggaaag ggatcctcag ttggccctgt 24360 

cactgctcta tcagctgggg acgtggcatc ctagtgaaaa catcatggcc gggcgcggtg 2442 0 

gctcacgcct ggaatcccag cactttggga ggctgaggag ggtggatcac ttgaggtcag 24480 

aagttcgaga ccagcctggt caacatggtg aaacccatct ctactaaaaa tacaaaaatt 24540 

cgccaggtgt ggtggcgggt acctgtaatc cgagctactc gggaggctga ggcaggagaa 24600 

tcgcttgaac ctgggaggtg gagcttgcag tgagccgaga tcttgccact gcactccagc 24660 

ctgggcaaca gagtgagacg ctgtctcaaa atctcaaaca aacaaacaaa caaaaaacaa 24720 

acaaacaaag cgtcatttat ccagcacccc tggggaacca tgctacctgg tgttttatgg 24780 

tacctggcaa ggtgcaggtg aagttgctgc tcttgggcat tgaacccgtc ttgtttgggg 24840 

cagctcaggc cccaggcagg gtccgggttg gctctcgttg gtgtggccct ggcccatcca 24 900 

gacctatatt tctgccgtcc tgcaggtgat caatgttgat gggacgaaga ggcggaccct 24960 

cctggaggac aagctcccgc acattttcgg gttcacgctg ctgggggact tcatctactg 25020 

gactgactgg cagcgccgca gcatcgagcg ggtgcacaag gtcaaggcca gccgggacgt 25080 

catcattgac cagctgcccg acctgatggg gctcaaagct gtgaatgtgg ccaaggtcgt 25140 

cggtgagtcc ggggggtccc aagccatggc tcagccatgc agacttgcat gaggaggaag 25200 

tgacgggtcc atgcctgggc ataagtgttg agctcaggtg ccccgacctg gggaagggca 25260 

ggacaggaaa ggtgacagta tctggccaag gacagatggg aagggaccaa gggagctgat 25320 

tagggagtgg ttatggacta ggaatgtcgg taacaatggt tagaaagtga ctaacatttg 253 8 0 

ttgagcacct gctgtgtgcc cggccctggc cgggagcctt cgtgcccaca gtgaccccgt 2 544 0 

ctgcaaatgt agttccttgc cctactcgca ctggggagca ggacgcagag ccgtgcaact 25500 

cacaggtgcc aagctcagga ctccctcctg ggtctgcctg ggctgggctg tgcttgttgc 25560 

ccctgtggcc cacgcatgtg caccttccac ctgaaagcca ggatcttcag gacgctcccc 25620 

gaggaggtcg ttgtctggca caatgatttg tctcttcctg aaaaggtgac agagttacac 25680 

tggagagagc agcatccagg tgcggcaggg acaggcctgg ggctcgcggg cagggactct 2 574 0 

gtgtcctgcc ggggtcccac actgcacctg cttgtcagag gcactcagtc aatctttgct 25800 

gatgaaggat gagaggacag aggacgtgat gcttgctgct gcattgcctg cagtcctggg 25860 

tgagatgccc gggttgactc tgctgcccgt cgggtggatg tgatgtcaga tccccggctt 2592 0 

taaaatacga gggagctggg aattgaggga gcaggttggg gcagaaagca cagccccgtg 25980 

gaagcctgga gctgaggcag tgtgggcgac ccctggagca gtgagtgctt ccttcatggc 26040 

cttcatcgca ccctgcagtc ctcatgtagg ggatgccatc catgaattta gttttcccag 26100 

cctcctttaa aaacgcgttc atgctggggc cggggcagtg cagtggctca catctgaaat 26160 

cccaccactt tgggaggccg aggcgggtgg atcatgaggt caggagatcg agaccatcct 26220 

ggctaacaag gtgaaacccc gtctctacta aaaatacaaa aaattagccg ggtgcggtgg 26280 

cgggcgcctg tagtcccagc tactcgggag gctgaggcag gagaatggcg tgaacccggg 2 634 0 

aagcggagct tgcagtgagc cgagattgcg ccactgcagt ccgcagtccg gcctgggcga 26400 
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cagagcgaga ctccgtctca aaaaaaaaaa aaaaagtaca aaaaaaaaaa aattagtctg 2 6460 

ggtgtggtat cacgcgccta taatctcact actcgagagg ctgaggcgga gaattgcttg 2652 0 

aacccaggag gtagaggttg tagtgagccc gtatcgtacc actgccctcc acctgggcaa 26580 

tagagcgaga ctctgtctca aaaagaaaaa aaaaaaaaga acatttatgc caggtgtggt 26640 

ggctcatgcc tgaaatccca gaactttgga agactgaggc aggaggatca cttgagccca 2 6700 

gaaatttgag agtgfccttcc ctgggcaaca tagagagacc tcatctctac cagaaaaaaa 2 6760 

aaaattagcc cggcatggtg gcatatccct gtggtcccag ctacttaggg ggctgacgtg 2682 0 

gcaggatcac ctgagtctgg aggcagaggt tgaagtgagc tgagatcatg ccactgcact 26880 

ccagcctggg tgacagacag agaccctgtc tcaaaaaaaa aaaaaaaaaa aagcatttac 26940 

tatccaccat ggaaggtgag actgacctgt gagtgattgt tcaaagaaca aaaaataaac 27000 

cccagagata agacaaaagg gtgcctccat gggggtgtga tttaaagctg agaaattggg 2 7060 

cttcttcccc ctcccctctc accccgtggt ttgctaaagg agatgggaaa aaggattctt 27120 

tttttggctg aaatatttaa cactaaatta aagccaattt taacagcact ttggttgatg 27180 

agtgaaatta acagactggc caaaaataaa cgaacggtct gtactatgtg aaaaagaggc 2 7240 

agctttggcc atgctgggcc aatgtgagtt ttcagggttg ctgggaatgt ctgtgaatcg 27300 

gaggaagggc ctagctggga ctctcaggag ccaaggccct gaggggcaac ttgcctggtc 273 60 

cctgccctga ggcgttcact gctttcttcc tgggccagat cacaggcccg gaggctggac 27420 

cactgggctg gcactcttgc cgagctgctc cctgacttcc tgaccatgct cctttcagca 27480 

gccttgctgc actttagttt ccttgaatga aaaatgggga tgagaatagc tcctacctcc 27540 

aaggtgaatg gagtgagttc ggacaggtga ctccctggga ccagtgcctg gcgcctgaca 2 7600 

aggtccagtc agagcccgca ctgctgttac tgataccctt ggctgtacca ggggagaact 2 7660 

tggttgccat tgccaggtgt tctcccacca cccccactac tgtccctgtt tgatgtgtgg 2 772 0 

cgggaataaa gctgtgcaca ttggagcttt tggcacatcc tggctttcag gtgaaaggtg 27780 

cgtgtgtgtt tgagggttta gcctggccaa cccagccatg aggtcggacc tgacctgggg 27840 

gtgagtcctg agctcggcac ccctgagctg tgtggctcac ggcagcattc attgtgtggc 27900 

ttgggccgca cccctttccc tgctgggctg ttgatgttta gactggagcc tctgtgttcg 27960 

cttccaggaa ccaacccgtg tgcggacagg aacggggggt gcagccacct gtgcttctgc 2802 0 

acaccccacg caacccggtg tggctgcccc atcggcctgg agctgctgag tgacatgaag 28080 

acctgcatcg tgcctgaggc cttcttggtc ttcaccagca gagccgccat ccacaggatc 2 8140 

tccctcgaga ccaataacaa cgacgtggcc atcccgctca cgggcgtcaa ggaggcctca 28200 

gccctggact ttgatgtgtc caacaaccac atctactgga cagacgtcag cctgaaggta 28260 

gcgtgggcca gaacgtgcac acaggcagcc tttatgggaa aaccttgcct ctgttcctgc 28320 

ctcaaaggct tcagacactt ttcttaaagc actatcgtat ttattgtaac gcagttcaag 2 83 80 

ctaatcaaat atgagcaagc ctatttaaaa aaaaaaaaga tgattataat gagcaagtcc 28440 

ggtagacaca cataagggct tttgtgaaat gcttgtgtga atgtgaaata tttgttgtcc 2 8500 

gttgagcttg acttcagaca ccccacccac tcccttgtcg gtgcccgttt gctcagcaga 28560 

ctctttcttc atttatagtg caaatgtaaa catccaggac aaatacagga agactttttt 2 862 0 

tttttttttt tgagacagag tcttactctg ttgcccaggc tggagtaccg tagcgtgagc 2 868 0 

tcagctcact gcaacctccg cctcccaggt tcaagcgatt cttctgcctc agcctcctga 2 8740 

gtagctggga ctacagacat gcaccaccac acccagctaa ttttttttat atttttagta 2 8800 

gagacagggt ttcatcatgt tggccaggct ggtcttgaac tcctgacctc aggggaacag 28860 

acggggttgg cctcccaaag ggcggaaata acaggggtga gccaccgttc ccggcctagg 28920 

aaaacttttt gccttctaaa gaagagttta gcaaactagt ctgtgggctg gccttctgat 2 8980 

tctgtaaaga aagtttgatt ggtggctggg tgcggtggct cacacctgta atcccagcac 29040 

tttgggaggc cgaggtgggc agatcacctg aggtcgggag ttcgagacca gcctcaccaa 2 9100 

cgtggagaaa ccccgtctct actaaaaata caaaaaaaaa attaaccggg catggcggcg 2 9160 

cctgcctgta atcgcagcta ctcaggaggc tgaagcagga gaattgcttg aacctgggag 2 9220 

gcggaggttg tggtgagctg agatggcacc attgcactcc agcctgggca acaaaagtga 2 9280 

aactccgtct cagaaaaaaa aaagtttgat tggtgtaacc aaagcgcatt tgtttatgga 29340 

ttgtctgtgg cagcttttgt tctgccgaga tgagttgtga cagatctgta tgggctctaa 29400 

agcctaaaac atgtgccatc cgccccttta cagaaaaagt gtgctgacct ctgttctaaa 29460 

gtattggaca actacaatgt ttgctcattt attattctat gatttgtttt ctgctttttg 29520 

ttgttgttgt tgttgttgag atagggtttc cctctgtcac tcaggctgga gtgcagtggt 29580 

gtaatttcag ctcactgcag cctcgacctc ctgggctcta gtgatcctct catctcagcc 29640 
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tccctagtag ctgggactac aggcacacac caccactcct ggctgatttt tttttttttt 29700 

tttttttttt gtggagacag ggtttccgca tgttgcccag gctggtttca aactcctagg 29760 

ctcaaacacc cacctcagcc tcccaaagtg ctgggattac aggcgtgagc caccatgccc 2 982 0 

agcctattct actgtttgta ttacatagct ttaaaagatt ttttatgact ttaagtcaca 29880 

agggttcttt gtagaaaaaa atatatatat aggaaagtat aaaaagaaag taaaaattgt 29940 

ccataacctc tccagccaga gacgaccgtt gctgacacct cagcatattg cctttaagtc 3 0000 

ttttttctct aagatagcat ttctcttcat cacagtcata tgctacgcag aattctgtat 30060 

cctgattttt tcacttgaca ttacaacagg tatttgatgg cgctgtgaca aactctttgg 3 012 0 

cacaatcttt taaatgtatg aaatactcca ctgcacagat gtttgctttt aggcttaact 3 018 0 

gttcttttat tttgcgtgtg ctggttacag ccgggcacag tggctcatgc ctgtaatcac 3 024 0 

aacactttga gagggtgagg caggaggatc acttgagccc agaagtttga gaccggcctg 3 0300 

ggcaacatag tgagacccca tctctacaaa aaactttttt aataagtcgg gcgtagtggt 30360 

gcatagctgt agtcccagcc accaaggagg ctgagttggg aggattgctt gagccccagg 3 042 0 

aggttgatgc tgcagtgacc tgagattact ccactgtact ccaacctgag cgacagagca 3 0480 

agacttgtct ggggaaaaaa aaaaaaaaaa tatatatata tatatatata tatatacata 3 054 0 

tatacataca cgcacacaca cataatataa aaatatatat ttataaatat ataatatata 3 060 0 

atataaaaat atatatttat aaataaaatt tataaattat atttataagt aaatatataa 3 0660 

tatataatat aaaaatatat attatataat atataataaa atatataata taaaaatata 30720 

tatttataaa taatatataa tacatactta taagtatata tttaaaatat atgtaatgta 30780 

tattttttaa tgtatgatat ataatataca tttataaata cacatttata ttattttata 3 0840 

taaaatatat ataaaatctc caagttgctt tttccaaaaa ggtgtcttgc tgcatttcaa 30900 

acattcattt aaaaacttga atgctggtga tctggtccag aatgtgttca gtagctgctg 30960 

ccagtggcca agcatctcgg gagatgtcta caaaacacgc tggttctggc ctggcgtggt 31020 

ggctcacgcc tgtaatctca gcactttggg aggctgaggc aggtggatca actgaggtct 3108 0 

ggatttcgag accagccttg ccagcttggt gaaaccccat ctctactaat aatacaaaaa 3114 0 

aattagccag gcgtggtggc atgtgcctgt aatcccacct acttgggagg ctaaggctgg 312 0 0 

agaatcgctt gaacccaggg ggcagaggtt gcagtgagcc gagatcgcac cattgcactc 312 6 0 

caggctgggc aagaagagcg aaactccgtc tcaaaaaaaa aaaaaaagat gctggttcct 3132 0 

aaaatgtggc ccttttcctc ctcacctgct gccagaccat cagccgcgcc ttcatgaacg 313 8 0 

ggagctcggt ggagcacgtg gtggagtttg gccttgacta ccccgagggc atggccgttg 3144 0 

actggatggg caagaacctc tactgggccg acactgggac caacagaatc gaagtggcgc 3150 0 

ggctggacgg gcagttccgg caagtcctcg tgtggaggga cttggacaac ccgaggtcgc 31560 

tggccctgga tcccaccaag gggtaagtgt ttgcctgtcc cgtgcgtcct tgtgttcacc 3162 0 

tcgtatgaga cagtgcgggg gtgccaactg ggcaaggtgg caggctgtcc gtgtggccct 3168 0 

cagtgattag agctgtactg atgtcattag ccttgatggt ggccaggact ggtagggccc 3174 0 

tcagaggtca tggagttcct tcgtggagcg ggtgctgagg ctgtatcagg cacagtgctg 31800 

gctgctttca cctgggccgt ctcaccgaag tgtccatgga gcctgcgtag ggtgggtatc 31860 

tgtgtcgatt ttacagatgc agaaacaggc tcagagaaac cgagtgactt ccctaaggtc 3192 0 

acatacccag ttagagcaga gctgggccag gaagtgctgt ctcaggctcc tgaccaggtc 3198 0 

tccttgcttt gcactcttgc caaaaccatg atccagaact gactttgagg tccccggacc 32040 

tcaggctcct ccgaaatggc ctcttggagg ctgctgagcc acagcttagg acccacctcg 32100 

agaggcaaat gtgctttgag ctgccaggcg tcctgggggc cctgccttgg gcacggggtt 32160 

cagacaggcc ccagatgtgt ggggcgtctt tctggacttg agttttcttt tctgtgtggt 32220 

ggacacagtg ctcacccctt aaagcacctg tgatgtgtgc agcagcccaa tccctgcctg 32280 

tcgcctgttc tgctagggaa ggaaggaata cttcaggatg gcaggacaac agaaagaggt 3 234 0 

ccaggtttta gagcaagggc aggtcaaact tagaaaattc tggaatgagg atgtgcattt 32400 

cctcttctgg atctgctaaa agaagaggga aggaggggct gctgggggag gagcccagag 32460 

ccgagtttac atccggatcc cgcaaggcct cccctgccct gaggtcttgt tttgtgatgt 32520 

gcttgtgtcc atcctggttt ctgccgtgtc cccaacatcc ggccaagctt aggtggatgt 32580 

tccagcacac actcaccctg tctgtgcacc tgtttttgtg tccgtaagtg ggtatttact 32640 

caccttacga gtgagccact gtgggaattc agggaggtgg cgcagtgacc acccctggag 3270 0 

ggatatgtgt gtggcagggg tcgagggtct cgcccttccc tgcttcctgc gcgtggcttt 32760 

ctccaggacg gggagggctg agctgaagag gtggggacag ttgcgtcccc ccgccaccca 3282 0 

ctgtcctgcg gtgagagcag actcactgag cctgcccttc tcccttgtgc cttccagcta 32880 
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catctactgg accgagtggg gcggcaagcc gaggatcgtg cgggccttca tggacgggac 32940 

caactgcatg acgctggtgg acaaggtggg ccgggccaac gacctcacca ttgactacgc 33 0 00 

tgaccagcgc ctctactgga ccgacctgga caccaacatg atcgagtcgt ccaacatgct 33 060 

999tgagggc cgggctgggg ccttctggtc atggagggcg gggcagccgg gcgttggcca 3312 0 

cctcccagcc tcgccgcacg taccctgtgg cctgcaagtt ccccaacctg gcaggagctg 33180 

tggccacacc cacgactgcc cagcagcctc accctctgct gtgggagttg tccccgtcca 33240 

cccctgggtg cctttgctgc agttatgtcg ggagaggctc tggtgacagc tgtttcctgt 33300 

gcacctgctg ggcactaggt cccagctaat ccctgtgcca ggactctaat ttcaccctaa 33360 

cacacatggt ggttttcatt gctggggaag ctgaggcctg agcacatgac ttgccttagg 33420 

tcacatagct ggtgagttca ggatccccca gagataccag ggccagcact cgatccccac 33480 

ccagccctga accccaccat gtgctgggat tgtgctggga gtgtccacac gcctgggacc 33540 

ccagggctgg tgctctcatc tcctttttcc agatcatgag aatgaggctc agggaagttt 33600 

gaaaaaaacc tatcccaagt cacacagcaa caggagcagg atttgaaccc agaaaagggg 33660 

accgcacact ctgttctgct agagtagtta gctgtcctgg gtgatatggc aggtgacagg 33720 

ggcaactgtg cttaacaaag gaacccccat cccccctgcc aagttgggag actagaaggt 33 780 

caggggcaga agctctgaag ggccaggtgc agtggctgac acctctaatc ccagcacttt 33 840 

gtgaggccaa ggcgggcaga tgatttgagc ccaggagttc aagatcagcc tgggtaatgt 33 900 

agtgagacgc catctctaca aaaaaatttt ttaaaaatta gctgggcatg gtggttcatg 33960 

cctgtagtcc aagctacttg ggaggctcag gtgggaggat tgcttgagcc caggaggttg 34020 

aggttgtggt gagctgtgat catgccactg cactccagcc tgggcaatag agtgagaccg 34080 

tctccaaaaa aaaaaaaaga agaagaaaaa gaagctctga ggctccaagt ccccaggcac 34140 

cccttggctt gagggcagac aagggaggag agggtcacct gggcagccct gacttttgtc 34200 

ccctggcaaa gggaccttca gtgaccttgg ccctaggaga gcctctgagc acgtcagcca 34260 

tgtcgaaccg ctcaggaagg gcagcaagaa tttggcttct gacctctgcc tctcctactc 34320 

gccatctgca ctgggtgtgg ttgtgcccat tttacagatg aggaggctgg ggcatcgacc 34380 

agctgaatgc cttgtcccag gtactgcgta ggcagagctg gcagttgaac cccgtgtcct 34440 

ggttgtcgct gggggtgggc tgcaccctga cttgtgaggc cagtagcaag gtttgcacgt 34500 

gacttcgtga ccgtcaccca gctctgcagc acatcccgtg acccagctca tccaggccgc 34560 

atgcaaacct gttgccaggc gagaaaccag tcaccgcaca gctgtggttg cctgaaatga 3462 0 

ttaagctcat taatcacccc ggagtgagga cagactcaga tgaaaaccag caaaagccct 34680 

ggaaactcat gtgaccctgc caatgagggc ggccatgtgc attgcagcct ggccgtcact 3474 0 

cctcggtacg tgttttggac ttaaacgctc cggatgttta ctgagtgctt gattaataac 34800 

atggaaggcc tggtctcatt gctgtgggag tgaaggatgc acagccaggc ctgacatgat 34860 

gagaacaaga acctggagtc tcgctgcctg ggtggtaatc ctggccctgc cacttagcaa 3492 0 

ctgtgtgact gtagccaggt cacttaattt tgctagatcc tgcctgcgct tcagtggatc 34980 

ttgctggttt tccaaggtgg ccaaacactt taaggcattc atgtggtcgc taggctgcag 3504 0 

ggttgaaccc tggctcaccc cgcagggcgc cgtgtgctct gtggcctggc tgtgcctttg 35100 

ctgacaccgt gcccgtgtgt gttcatgcag gtcaggagcg ggtcgtgatt gccgacgatc 35160 

tcccgcaccc gttcggtctg acgcagtaca gcgattatat ctactggaca gactggaatc 35220 

tgcacagcat tgagcgggcc gacaagacta gcggccggaa ccgcaccctc atccagggcc 35280 

acctggactt cgtgatggac atcctggtgt tccactcctc ccgccaggat ggcctcaatg 35340 

actgtatgca caacaacggg cagtgtgggc agctgtgcct tgccatcccc ggcggccacc 3 5400 

gctgcggctg cgcctcacac tacaccctgg accccagcag ccgcaactgc agccgtaagt 35460 

gcctcatggt cccccgcacc tcactccctc gttagatcag gctggttctg ggagctgacg 35520 

ctgaaaggag cttctcatct ggggttcctg ggtgtacata gatggttggg taggttgtgc 35580 

actgcacaag ctgcatgatg ctacctgggg gtccaggtcc aggctggatg gacttgttgc 35640 

ttcatcagga catagataaa tggccaaaac tcctcagctg gaaggtcctg ggcaggatct 35700 

ttgggtgtga aaaccagtca caggggaagg gtgcttgctc atactgccag cacagtgctg 35760 

agtgctttcc atagcgctcg tttactcctc aagcctggag ggtggggagt agcatggtcc 35820 

catttcacgt acaaggaacc cgatgcacag agaggtgtgg caacccatcc aaggccatac 35880 

aactggggtg ggttgagccg gggttgactg tggcaggctg gctcaagagt ccctgctcct 35940 

gaacccttgc caggcagcct ggcatcagct cggggaattt ttgccctgac ccttggaagc 36000 

aagtgggcct ctttgttctc atgtcagtga tgagaagagt gactttccta tggcccctct 3 6060 

ggagtacagg tgtttcctgt tggcgggctc ttcccccatg acatcagcag cgagctggtt 3 612 0 
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atgattccct acgcagaact tgatagttta taaagctctt tgtcatccag gccccgttgg 36180 

agtctcacgc agacctggtc gcaggcgggg ctggtcttgc ctgtcccagc tgcatggatg 3 6240 

gggaacttga ggcttgcaaa ggttaagggg ctgttcgagg cccacgctgg caggagatgg 36300 

gcctgggcca gagtctggga cttcccatgc ctgggctgtc tttggtcctg ttgctcacca 36360 

tccctccctg gggccatgac cttagagagc caaatggagg tgcaggtaac ccacggcaag 3 642 0 

gaggggttgc catgactcag agtccccgtc ctgtggccgg cagtacctgg tgcaacgact 36480 

tggatttcag accagccact gtagcccgct gacggtgcgc tcgaagtgcc acagcttctg 36540 

aagccaggca ggactcaggc caggagactc tgttagctgt tgagagggag aggccaacgg 36600 

atgttctggt tctgctagag agctggttct tcggatcctg gtaccagtgc actgagagga 36660 

ggcccagctt gattctgggg ctgccttgtg gtggcatgtg ctgctcactg acaccctcga 3672 0 

ggagtgtctt ctctcgggct tgttgactgt gcccggtttt ccgcagttca ctggtgcaca 36780 

cataggcaca tagcaaaccg cacacacagt cgtgggtatg agtttcacta cattccacca 3 684 0 

ccagtgttca ctaccattac ctgccttccg tcttaagtgt tcatcattta aaaataaatt 36900 

tattgggctg gacgcggtgg ctcatgactg ttatcccagc actttgggag gctgaggcgg 3 6960 

gcagatcacc tgaggtcagg agttcaagac cagcctggcc aatatggtga aactccatct 3 702 0 

ctactaaaaa tacaaaatta gctgggcatg gtggggcatg cctataatcc cagctactca 37080 

ggaggctgag gcaggagaat ggcgtgaacc cgagaggcag agcttacagt gagcccagat 37140 

agcaccactg cagtccagcg tgggcaacag tgcgagactc catctcaaaa aaaaaataaa 37200 

taaataaaag aaaaataaat ttatgatcta tttcaaaaat aacacatgta ctttgaaaca 37260 

gcagagacac atatgacacg gagaatgaaa ttccccatag cgcaccccca agagacagcc 37320 

ctggtccccc cgtctttccc gtggacctcc agcggggcag atgctgagcc gcctgttgtc 37380 

gagtggcatg ctatcccgtc ctccagctcc tctgtggctt acagacaccc acctgcagcc 3 7440 

ctgtctttgc ctcctctagc gcccaccacc ttcttgctgt tcagccagaa atctgccatc 37500 

agtcggatga tcccggacga ccagcacagc ccggatctca tcctgcccct gcatggactg 3 7560 

aggaacgtca aagccatcga ctatgaccca ctggacaagt tcatctactg ggtggatggg 37620 

cgccagaaca tcaagcgagc caaggacgac gggacccagg caggtgccct gtgggaaggg 37680 

tgcggggtgt gcttcccaag gcgctcctct tgctggtttc caggctgctg cccctgtcct 37740 

tagcagaggg aggaaacaga ggatggctct gggtgaatga tgacttgggc ttcgattatg 37800 

tagtcacagg gtatgaccct gagatgcgtg gaaccccgag actgtgatta tatgtagaaa 37860 

ctgggtttcc ccgttgttta agtagtcatg gtggggtcag accccacagg acttttgtct 37920 

tttcaagaaa gaaaatggtc gtgtgtcatg caggggtagt tggtactggt taatccaggt 3 7980 

ttatccttta ttttgtggga actgtacagt catttctgct acaatgctgt atatgctctt 38040 

ctgaaagaca cctatgcaaa atcgcacagt aaaaatgaca caactcatag ggaaagcggg 3 8100 

gccagggcac agccctcaaa atctccatca atgacatgta agaaaagaga ggaacctggg 38160 

aaatagcaaa gtgccttttg cacattaaat ggttagctat atcccacaat actgtgcatt 3 822 0 

cgtaaacgtt aatgctgcaa taaatacggc acttcacctt gggaagatct ggagttggct 3 8280 

tatgagtgtg gaagggtgta gcgcatgagt ttttgtgaaa cactggaagg aggattgtgg 38340 

gaaatcaaat ggaaagttct caccccaggc gtggagaaga gtgggtcatg gccccagcag 3 8400 

tgagcccagg gaggtcagag acggaggtgt gtgtgtgggt gtgaccctgc gcagttccct 3 8460 

gccggctgta gttttttgca ttcgcttaat gtttctcgtg gaggaaattg tgcatgagca 3 852 0 

aatgtgaaac cgtgctgtgc tcaaattgtc ctaatacatc attgcattgg aacagattgg 3 8580 

ctttnttttt tttttttttt tttttttttt tttgaaatgg agtctcactc tgtcaccagc 3 8640 

ctggagtgca gtggcatgat cttggctcac tgcaaccttt gcctcctatg ttcaagtgat 3 8700 

tttcctgcct cagcctcctg agtaactggg attacagggc atgagccacc gcggccggcc 3 8760 

agatttgcat ttttgaaaca actgctaggc tgggcgcggt ggctcacacc tgtaatccca 3 8820 

gcactgtggg aggccgaggc aggtggatca cctgaggtca ggggttcgag accagcctgg 3 888 0 

ccaacatggt gaaaccccgt ctctactgaa tatacaaaaa tcagctgggt gtggtggcgg 3 8940 

gtgcctgtaa tcccagctac tcaggaggct gaggcaggag aattgcttga acccaggagg 3 9000 

cagaggttgc ggtgagccga gatcacacca ttgcactcca gcctgggcaa caagagcaaa 39060 

actccatctc aaaaaataaa aaatagaaaa acaagtgctg tagcggaagt gagcactttg 39120 

cggagtcagg cttgtgtggc ctgttccaca aatgatgtgc tcacggtggc ctcaggccca 39180 

cctggagtct gcagcatggg gcacaacagg ttcattagtg tagaattcca ggacaggcct 3924 0 

ggctcctaag cagccttctt ttacaaaaac tgcagagccc gcctgtatcg tagcactttg 3 9300 

ggaggccgaa gtgggtggat cacgaggtca ggagttcaag accagcctgg ccaacatggt 3 93 60 
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gaaaccccat ctctactaaa tatacgaaaa ttagctgggt gtggtggcac gcgcctgtag 39420 

tcccagctac tcgggaggct gaggcagaat tgcttgaacc tgggaggtgg aggttgcagg 3 94 80 

gatctgagac catgtcattg cactccagcc tgggcaacag agcgagacgc catctcaaaa 3954 0 

aaaaaaaacc tacagagcca cacggcctct ttctccaccg agtgttggtg tgggagcttg 3 960 0 

tgttattgtg gtgaaatctt ggtactttct tgaggcagag agaggctgag cgcctggaga 3 9660 

gactttcaca tgggtcgcca tgtccgccgt cggtttcgct gttgtgctcc ccatctgaag 3 972 0 

gctggtgccg tccagacagg ctggacgccc ctttccacca gatccttcct cccgcagcag 39780 

tttctagtta cgttgtactg tgaggtctgt gtccttggtt gatggcaaaa gtcagccgaa 3 9840 

ttgaaattca gagccatgcc tggctccctg gagcttctct cctgggcagc tgtgatcatt 39900 

gcctctgctg tggtgtgggt ggtggaaatg gattcctttc atcttgcttg ctacaggtga 39960 

ctgtcacgtg gagtcctttg gagagaggga cgtgttaatt gatggatgtg gctcccatgc 4 0020 

tgagaaagct cctgggcgta cattgcctta ■ gagtttcatt ggagctgcgt tcttttatgg 40080 

tgtctgctag gcagaagtga tgaagacttg gaagaaaacc cagaaggttt tccacttaat 40140 

ttggaaaatg tgcttttccc ctcctgtgtc ttttgctaag gtccagcctc ctgcagcctc 40200 

cccgctctgt ggactctggc tttgattctt tattaggagt ccccctgctc ccccaaaaga 40260 

tggtgtctaa attatcatcc aattggccga ggttttgttt tctattaatt gtttttattt 4 032 0 

tttattgtgg taaatttata taacataaaa tttgccattt taattgtttt gttattgttg 40380 

tttttgagac agggtctcac cccagtgccc aggctggagt gcagtggtgc gatcatggct 40440 

cactgcagcc tcagcctcca gggctccagt gatcctctca cctcagcctc tctagtagcc 40500 

gggactacag gcatacacta ccacatctgg ctgatttttt gtattttttt tttattgtag 40560 

agacccgcta tgttgcccag gctggtctca actcctggac tcaagccatc ctcccacctc 4062 0 

accctcccaa agtgctggga ttacaggcat gagccacaac acccagccat tttaattttt 40680 

tttttttttt ttgagatgga gtctcactct atcgcccagg ctggagtgca gtggcgtggt 40740 

atcaactcac tgcaacctct gcctcccagg ttcaagcgac tctcctgcct cagcctcctc 40800 

ccgagtagct gggattacag gtgcccatca ctatgcctgg ctaatttttg tattttttag 40860 

cagagacggg gtttcaccat gttggccagg ctggtcttga actcctaacc tggtgatccg 40920 

cccgcctcgg cctcccaaaa tgctgagatt acaggtgtga gccaccgtgc ccggcctttt 40980 

tttgtttttg agacagggtc ttgccctgtc acccagactg gagtgcaatg gtgggctctt 41040 

ggctcactgc agcctccgcc tcccaggctc aagttgtgca cctccacacc tggctaactg 41100 

tattttatgt agagacagat ttcaccatgt tgcccaggct gggcttgaaa tggactcaag 41160 

cagtccaccc acctcagcct cccaaagtgc tgagattaca ggcgcgagcc accgcaccca 41220 

gcccatttta cctattctgc agttgacagt tcagtggcat tcagtcagtt cacgaggtaa 41280 

ccatcactgc cattcatctc cagactactt caccttctcg gcagatgtcc gaaactgtcc 41340 

gcattgaaca cactcctcat ctccctctga cagccaccat tctactttgt atctctctct 41400 

gccttctcta ggtacctcat gtaagtggaa ttataccaat atttgccctt gtgtgactgg 41460 

cttctttcat gtgacatggt gtcctcaagg ttcatctgtg ttatagcctg tgtcagaatt 41520 

tccttcctta aagcctgaat aataacccgt tgtaaaggct gggcgcggtg gctcacaccc 415 80 

tctaatccca gcattttggg agtccgaggt gggcagatca cttgaggtca ggagtttgag 41640 

accagcctgg ccaacatagt gaaaccctgg ctctactaaa agtacaaaat tagctgggtg 41700 

tggtggcgcg cacctgtaat cccagttact caggaggctg aggcaggaga atcgcttgta 41760 

cccgggaggc agaggttgca atgaaccaag attgtgcctc tgcagtccag cctgggtaac 41820 

agagtgagac ttcctgtctc aaaaaaaaaa aaaatcatcg gatggatgga cggaccactt 41880 

cttgttattt atccatccac gggtgctagg tttcttccac ctttggttgt cgtgaataag 41940 

gccactatga acatttcctt ccgtggtgaa ggttttgtac tagtgaggaa aaggcgtgtt 42000 

tgtggtgttg cataggattc tggtaagaaa gtttgcacta accataagta tttgtactac 42 060 

attaaaatga aagctcaggg gccgggcgcg gtggctcacg cctgtaatcc cagcactttg 4212 0 

ggaggccagg gcgggcggat catgaggtca ggagatcaag accatcctgg ccaacatggt 42180 

gaaaccccgt ctctactaaa aataccaaaa aactagccag gtgtggtggc gggcacctgt 42240 

agtcccagct acttgggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt 423 00 

gcggtgagcc gagatcgctt cactgcactc gagcctgggc aacagagcaa gactccgtct 42360 

cacgcaaaac tctgtctcac gcaagactcc gtctcaaaaa aaaaaagagt tcagggttta 4242 0 

tgaaactggc 'cagccgcgta aagtttgctg tgttgttttt gtgcccggga ggagtgtggc 42480 

cagggtgtca cgtcacacag tacacgtttc tcagatggtg gttctccaga ctgctgtccc 42540 

aaagtctgtt tttgcatctg gttcccacag acccaccctc cacggtgagc ctgattttgg 42600 
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ccagggtagc tggaatcttg cttgtctttc agcccggcag ctgtaccagt ccagggtcca 42660 

cagctagtgg cttttaggaa ggaatttgtt cagttggctt tgacacatgg ccccctaggg 42720 

tccacagctc tgtagtgatg tggatgttgt tatctacaaa gacacatgat ccttcgtgtc 42780 

cagatgaaag tgatgatgtc tttgcagctg cccagcaagg ctgtgtgtgt gtgtgtgtgt 42 84 0 

gtgtgtgtgt gtgtgtgtgg tgtgtgtgtg gtgtgtgtgt gtgtatgggg gagggaggca 42900 

ccctttccat ctgggggtgt gtgtgtgtgg ggtgtgtgtg tgtgtgtgcg cgtgtgtgtg 42960 

gtgtgtggtg tgtgtgtgtg tatgggggag gcaccctttc catctgggtc caagagactg 43020 

ggcctgggga agacgcttct ttttatctac ttagagactt tgttttattt gtattttttt 43080 

gagacagggt ctcactctgt cacccaggct ggggtatggt gatatgagca tagctcactg 43140 

cagcctcggc ctcccaggct gaagcgatcc tcccacctca gccttctgaa tagctgggac 43200 

tgtaggcgtg cgtcaccata ctgagctatt gttttttttg tttggttggt ttaatttttt 43260 

ttgatacaga tggagtcttg ctatgttgcc cagactagtc tcaaactcct gaactcaagt 4332 0 

gattctccca cctcagtttc ccgacattct gggatcacag gtgtgagcca ctgctgtctc 4338 0 

cctgttttat taactgctga aagacctaga taaagaaagt ctgaaaagac ttactatcag 43440 

agcaccatcc taagatgatt ccctctgact caatggagag ggaggggagc ttttccttca 43500 

ggcctgggtg gcaggagccc aggtgctcca ggccccattt gccccaggcc aaatcactcg 43560 

ggaacttgga tgcagctgtc tttcagggta acccaaagga accagatccc cgcaggcagt 43620 

aggcttctgg gctgtcctct cctcctacgt cagctcagta agagcccttc gaagggatgc 43680 

tgtgtcggag gccccaaaag cccaggctca tccctgagat gcacagggtg ggctgggctt 43740 

aggcagcgct cgagcatctc ctggacggtg accccagaga gtgtggagac ggagagtcct 43800 

tgagagtcac tgagagacgt ggctgccctg ccttcccaag aggggctctg agtcattccc 43 860 

cacactcacc tgcccctacc caccctcacc tggcccccag cctcacctac ccccacatct 43920 

gtaccgatcc ctttacccgc accttcccta cccaccctca cctcccctgt accttcacct 43980 

cccccactca cccgcccctg caccctcacc tgtcccccac cttcacctaa cccccaccct 44040 

cacctgccct cccctcacct ggcctccttc cgttggggaa ggggttgtaa ggggcggccc 44100 

ccaaactgtc tgtcctggtg ccctgcagag aaaacagtac gtgagggccg cagtccaaaa 44160 

gcttgagtcc tggaaggtgg aggagacagg gatgtgttgg gaagggcccc atggtcttgg 4422 0 

atcccttctc gactgtcaat ggggccttca tgggagcgcc agtctagtga tgcacagctg 442 8 0 

ggtgcccggc gggtggctga ggaggcctaa agtccgaggc ggcaagagct cttccagagg 44340 

ctgttgtcct aatcgctctg gcatactcag gcgggcacgt agttaggagc tgattggaga 44400 
ggagagaccc ccacaccaat actgggattt gactttcagg ctaaacttga gaagtgtggc - 444 60 

ctctgctgtc ctgccagagc tctccagcca gtgcccaggg ctctccagcc agtgcccggg 4452 0 

ggtctccacc agtgcccggg ggtctccgcc agtgccaggg gtctccgcca gtgcccaggg 44580 

gtctccgcca gtgctcagga gtcttggttt ctttgtctta cagccctttg ttttgacctc 44640 

tctgagccaa ggccaaaacc cagacaggca gccccacgac ctcagcatcg acatctacag 44700 

ccggacactg ttctggacgt gcgaggccac caataccatc aacgtccaca ggctgagcgg 44760 

ggaagccatg ggggtggtgc tgcgtgggga ccgcgacaag cccagggcca tcgtcgtcaa 4482 0 

cgcggagcga gggtaggagg ccaacgggtg ggtgggggtg ctgcccgtcc aggcgtgccc 44 880 

gccgtgtctt ctgccgaatg ccagcctctc acaggctggg gagactttcc accctgggga 44940 

tccaatgggt ggctttccag ggtcccaaaa gcaaacacag gctctttcac agcccctcca 45000 

ggaaagcaga aagccccaag ggctggaagg gaagggggag ctctgctgag aggttacaag 45060 

gcagcgctgg ccgacgggag ttgcagttga taggttttgt atcatccttg ttaaacttga 45120 

accctgtgca gaaatccctt ccacggcatg ggggctgcct gttgactcgc tcctgttcca 45180 

ccacagggag ctcctgggct tcttcctccc agaggccccc gacgctccca cctgttggtc 45240 

gtcagagctt ctggttggtg ggaaggcacc caggaccttg aggtctccag agagaaaagc 45300 

cagggaaaga gggagaccga aacccatgtg acatgaaact caggctccaa actgagcacg 45360 

ggaacgtttg gggacaggag cgcgatggcc ttcctcagat agctgggggg ctggcatgaa 4542 0 

gacgggagct acagccagca caggtcctgg gccgggagcc cagagattga gccctgactc 45480 

tgtcacttac tggccacgtg accttgggcg ggtggcatag cctcttggag actcagtttc 45540 

ctcattggta ggagtgacgg ccacagtggt gcggcctctg cagcacacgg ggggctcggt 45600 

gggcggaagc cccgggtcta taaggcggct gtgcaggagc cagccgagct ggtctcccaa 45660 

cagccagggc tccggggtcc ttagcagctg tggggggcct gcacctgttt cccatggctg 45720 

ctgtcagaaa ttaccagaag ccaggtggct gagagtaatg gacacttgtt ctctcacagt 45780 

tcctgagggc tgaagcccga gatcgaggtg tgggcagggc cctgcgccct ctgaaggctc 45840 
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tgagggaacc tttgggcttc tggtggctcc aggcacccct tgacttgtgg tcctgtcact 45900 

ccagtctctc tgtctggctg cacatggcgt ggcctcttct gtaccattga aggacacttc 45960 

agttggattt agggcctacc ctcacccatt gtggtcgtat cttgatcctt catgacattt 4602 0 

gtaaagaccc tgcttccaaa taagctcaca ttctgaggtt ctggggtgag cgggaatttg 46080 

gagagcattg ttcaactagt atagaatgtg acctgtcagc ctcgggcagc cctgagaggc 46140 

aggggctttc cacagcccag ctgggtgccc tgggctccgt gctgtccgag gagacgccat 46200 

ccccacaccc gtccttcacc cgccaccctc ccgcaggtac ctgtacttca ccaacatgca 46260 

ggaccgggca gccaagatcg aacgcgcagc cctggacggc accgagcgcg aggtcctctt 4 6320 

caccaccggc ctcatccgcc ctgtggccct ggtggtggac aacacactgg gcaagctgtt 46380 

ctgggtggac gcggacctga agcgcattga gagctgtgac ctgtcaggta cgcgccccgg 46440 

ggcctgccct aaccgcagac acccggcctt cattgtcagt aatggcagca gctgccacat 46500 

tgtccgagac ctgccgtgag cccagtgccg cgccaggggc tttgtgtgta gcgtgttttg 4 6560 

tcctcacact gacagctgta ggctggggtt ctgagtgagc cccacagggc agaggcagaa 4662 0 

aatgagtctc agagagggtg agcgagctgc ttggggcccc acagcaggag atggagcagg 46680 

actgcagcct agcctctgcc cccagcacct gcgcaagaag ctgctctgct ctggactgtg 4674 0 

ttaggctgcg agggctggag agaaatgaga gttggtgctt agagaggggg cgcaggtccc 4 6800 

catggctttt cctcttatga tgaggtagat gggtgaaggg aggggccatg cttgcagggg 46860 

ccagtgaccg aggcccgccg ttggaactga tggccttcat cccgagccca gcccaggtgg 4 692 0 

gagcagggct ttccgagggc ttgtcttggg tcggcctgct tccagggact ctgctgcagc 4698 0 

tcccacccct gtccaaagca tggaatcccc caggctccct ggcagtcctg tcaacctctg 47040 

tcctcccaag ctgagtgtgg ggcaagttct ggaggtcagc actgctcagg ggggcccacg 47100 

ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 47160 

gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 4722 0 

gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 472 80 

ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggctggcag 4 7340 

tggggtgggc agggtggcct ctaaacccga cccctggagg aggctggagg ccagtgcaag 47400 

atcctgtgtg gcctcagcca ggcggtggtc tctgccagat gccaactgtt gcccgctggg 47460 

gttcagcgac atgtccgaat gtcccgaggc ctctgaggtt gttttctttt gccgcagaac 47520 

aaatcaccac gaacagcgtt ttaagacaac accaactctt tttttttttt ttttttttga 47580 

gtcaggatct tgctctgttg cccaggctgg ggtgccctgg tgcaaacaca gttcactgca 47640 

gcctcgacct ctgggcttaa ttaagtgaac accttgcctc agcctcccag gtagctggga 4770 0 

ctacaggtgg gcaccaccac acctggctaa tttttttttg tagagacggg gtttccccat 47760 

gttgcccagg ctggtctgca actcctgggc acaagctatc tgcctgctgt ggcctcccaa 4782 0 

agtgctagga ttataggtgt gagccactgg cctgacaaca cccacggatt gtctctcagt 47880 

tctgtaaggc aaagtccagg cacagcgtgg ctcacctggg ttctctgctc agggtctcac 47940 

ggggccagaa tcaaggtgtc aggaacgctg ggccctcagc ggaggctctg tggagaaatt 4 8000 

agcttccttg ctcactcagc aggtagcagt tgtgggatcg aggttctgtt ttctctctgg 48060 

ttattggtcg gggaccactc tcagctccta gaggccaccc caggtccttg ccccgtggcc 4 812 0 

ctctctgcct cagcagtggg ggctccctgc gtcagtccct cccgcacctt gagtctctct 48180 

gatttgcttc taaagggccc tgtgattcgg ctcagccacc tttagattag gttagcctcc 4 824 0 

cctttgatag actccaagtc ggctgattaa taaccttact cacatctgca gaatcccttc 48300 

tgccacataa ggtcatgacg ccgtgctggg gactggggtg ggaaattacg gggtcattta 48360 

ggattctgcc tgccactgcc ttgctgtgtc ccagggcttg ggggaggggc ctccacagct 4842 0 

gggaccacag tccttcctcc cctccatggt aaccatctga ggattacttg agaccagcct 4 8480 

gggcaacatg gtgagaaccc atccctacaa aaaatacaaa caaaaaggga ccaggctggg 48540 

cttggtggct catgcctata atcccagcac tttgggagac caaggtgggc tgatcacttg 48600 

aggttgggag ttcgagacca gcctgcccaa catagtgaaa tcccgtctct actaaaaata 4 8660 

caaaaattag ctgggtgtgg tggcaggcgc ctgtattccc agctactggg gaggctgagg 48720 

tgggagaatt acttgaacct gggaggcgga agttgcagtg agccaaaatt acgccactgc 48780 

actccagcct aggcaataga gtgagactcc gtctcaaaaa aaaaaaaggg ccaggggtgg 4 8840 

tagtgacaaa gagaccctat cccaaaaaaa ccgaacactg aatccttgag actgagtaag 48900 

gacactgtga aatttttctg ggtggggcag ggaacagagc gtcttctgtc atttcttcca 48960 

cctgggtgtg gtcagctctc cctccaagct gcctcctctt cttctcattg tccgggtgtt 4 9020 

ggacacattt ggttaactgg atagaataac gcgagttccc agggacttgg tccatttgct 4 9080 
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attttatttt attttatttt attttatttt atttatttat ttatttattt atttatttat 49140 

tgagatggag tttcgttttt gtcgcccagg ctggagtgca gtggcgcgat ctcggttcac 49200 

tgcaacctct gcctcccagg ttcaagtgat tctcctacct cagccttcca agtaactggg 49260 

attacaggca cccaccacca taccaggcta atttttttgt atttttagta gagacgggtt 4932 0 

ttcgccattt" tgcccaggct ggtcttcaac tcctagcctc aggtgatcca cgcacctcgg 49380 

cctcccaaag tgctgggatt acaggcatga gccaccacgc ctggcaccat ttgctatttt 49440 

aattcccatg tgtattagtg tcccacggct gctgtaacaa atgaccacaa actggatggc 49500 

ttaaagcaac agaaatggat tcccccaatg tgctggagac cagaagcctg cgaccaaact 49560 

gttgggaggg ctgtgcttcc tctgggggct ccagggagga tctatttgtt ggcccttcca 4962 0 

gtgctgtggg tgccagcgtt ccacacttgt ggatgcgccg cctcaacctc tgcccatctt 49680 

catgtgtcca tctcctttgt gtctgcgtct ttacctcttc ttcttgtctg tgttgcctct 49740 

tataaggacg tttgtcattg ggtttagggc ccacccaaat catccgagat gacctcgtct 49800 

tgagatcctt aacctgcaaa gacccttttt ccaaaaaaag gttatgctca cagattctag 49860 

gccttaagac atgggtgtat ctttctgggg ggcactatcc aaccccttat acaatgaaag 49920 

acgggaagag ggccaggtgt ggtagttcac gcctgtaatc tcagcacttt aggaagctga 4998 0 

agcgggagga tcacttgagc ccaggagttt acaagtagct aggcaacatg atgagacccc 50040 

atttctacaa aaagtaaaaa aaaaaaaaaa aaaaaaaaag ccaggtgtgg tggctcacac 5010 0 

ctgtaatccc agcactttgg gaggctgagg caggcagatc acgaggtcag gagattgaga 50160 

ccatcctggc taacacggtg aaaccccgtc tctactaaaa atacaaaaaa ttatggccgg 5022 0 

gcgcagtggc tcccgcctgt aatcccagca ctttgggagg ccgaggtggg tgaattacaa 5028 0 

ggtcaagaga tcgagaccat cttggctaac acggtgaaac cccatcaaga tcacaaggtc 50340 

aagagatgga gaccatcctg gctaacacgg tgaaaccccg tctctactaa aaatacaaaa 50400 

aattagccgg gcatggtagc gggcgcctgt agtcccagct gctcgggagg ctgaggcagg 50460 

agaatggcgt gaacccggga ggcggagctt gcggtgagcc gagatcgctc catgccattg 5052 0 

cactccagcc tgggtgacag agtgagactc cgtctcaaaa aaaaaaaaaa aaagaaaatt 50580 

agccaggcac agtggcaggt gcctattgtc ccagctactt gggaggctaa ggcaggagaa 50640 

tggcatgaac ccgggaggtg gagtttgcag tgagccgaga tcatgccact gcgctccagc 50700 

ctgggcgata gagcaagact ctgtctcaaa aaaaaaagcc aggcatggtg gtgcatgcct 50760 

gtagtcccag ctactcaaga ggctgaggca ggagggttgt tcgacccacg gagatcaagg 5082 0 

ctacagtgag ccatgatcgc accactgccc tccagcctgg gtgacagagt gtgaccctgt 50880 

ctcaaagtaa gtaaatagga ggagagacaa gtgggcagtt cagactgatg gtatgggcac 50940 

agtagagact ggtgcagaca ggctggcctg tgatgtcaag caacttctgt aactgtttcc 51000 

ggcatccatt tgtgtgtcaa tttccgtgtc agtaggaaga ctctgtaggc tgccaagagg 51060 

aataagtggg aggatcctcc cagagaggcc gggcctgcag gagggccagt tctcatgagt 5112 0 

tcttatttgg cccctaccct ccaggctgtg gttctgaggt gggagacaga gcctgacctc 51180 

tgtttgtctt gttttgtctt tgcagcagcc cacccatgtg cccgtgacaa tggtggctgc 51240 

tcccacatct gtattgccaa gggtgatggg acaccacggt gctcatgccc agtccacctc 51300 

gtgctcctgc agaacctgct gacctgtgga ggtaggtgtg acctaggtgc tcctttgggg 51360 

tgatggacag gtacctgatt ctctgcctgc taggctgctg cctggcatcc ttttaaaatc 51420 

acagtccctg tggcatccag tttccaaagc tgattgtgtc ttcctttgcc ctcctttctt 51480 

ttctactatg tgcattcggt gctatgaatt ttcctctaag tactgcgttt cctgcatctc 51540 

acaaattttg ttacattttc attttcaggt agtttgaata tttttacact tctcctgaga 51600 

tgacatcttt ggctcatgtg ttatttagaa gtgttgctta gtttctaaag agttggggct 51660 

tttccagctg tctctctgca actgatttct aatttaattc tactgtagtc tgagagctta 51720 

ttttatatga tttctgttat tttaaatgtg ttgggtgtgg tgtttttgtt gttattgttt 51780 

ttgtgtcttt ttgttttgtt ttgcttcgtt tgttttgttt ttgagacagt gtcttgctct 51840 

gtcactcagg ctggagtgca atggcgcgat ctcagctcac cgcaacctct gcctcccggg 5190 0 

ttcaagtgat cctcttgcct cagcctcctg agtagctggg attacaggtg cacgccacca 51960 

tacccagcta atttttgtat ttttagtaga gacggggttt caccatgttg gtcaggctgg 52 02 0 

tctcgaactc ctgacctcgt gatccgccca cctcggcctc ccaaagtgct gggattatag 5208 0 

gcgtgagcca ctgtgcctgg ccattaggtg tgttttatca cccagcatca tgcagtttat 5214 0 

cttggtgaat gttctgtgta ctcttgaaaa gaatgtggat tctgctgttg ttgggtggag 5220 0 

tgttccagaa acatcaatta gatccagttg gttaatagtg ctcatcaggt tgtctctatc 52260 

cttccttcct gactgcctgc ttgagctgtc agttattgac aggggtgtgg agtctccaac 5232 0 
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tctaatggtg gatttgttta tttctcctag tagttctatc tttttctctc cttctaccct 523 80 

tgatcctctt ctccccctag ggcttcctgg tgttggtggt gggagagtgg ggtagtgaag 5244 0 

aacctggact ttagggccaa agaggccagg gttcaaatcc tggctctgtc acttcccagt 52500 

tgagtgaccc tggctggtgc ctgaatctct gtgagcctcc acttcctcct ctgtgaaatt 52560 

gagagcactt acctggcagg ctgtcatggg catcaagtaa cagggcactc cacctggacc 52620 

ctgacacgtg atgcacagga atgccagctg ctatgccatg ggtgtggcag tagtaataaa 52680 

gtgaccatct gtatcctcac cacagtgaag cctgtccagg gctttctctc ctatgccccc 5274 0 

atgcctccag gtggccttgg atcctgttgg ttctgtgctc tgctcagcga cctttctccc 52800 

gtgggagttc ctgggggttc agcttcatcc tacagacagc agcacacact ggctgtgcac 52860 

cctttttttt tttttttttt ttttttttga gatggagtct cgcttttttc gcgcaggctg 5292 0 

aagtgcagtg gtgtgatctt ggctcactgc aacctctacc tcctgggttc aagtgatttt 52980 

cctgcctcac cctcccaagt agctgggatt acaggctccc accaccacgc ccggctaatt 53040 

tttgtatttt cagtagagat ggtgtttcac catgttggcc aggatggtct tgaactcctg 53100 

acctcaggtg atccgcccac ctcagcctcc caaagtgcag ggattacagg cgtgagccac 53160 

cacacccgga gtgccggttg tttttagcag tttgtcttgt tcctggagag actggctcct 5322 0 

gcccaggagc tcggggagta gggccgcggg gtgctgcctc acacctcgag tttggccgta 5328 0 

agcagagggg acattttgtg actgtccccc tcctgagctt cccagcagct tttctccaag 53340 

ttacagccca aaagctcagg tggatttgca acccaacggt gtctgtgcac ctcccactga 53400 

tgcccgaact gccctggcca agaaacgggg ccgtcagaac gctgcactaa ctgcagcctt 53460 

gggcctccat gccagaggcc atgcccttcc atccaccacc ccctggcctg ggccctggcc 53520 

ctcctggctc gggaactcca ggccccttcc tcacggatcg agagacgtgt atttaccgca 53580 

caggtgcttg tcattctctt gtggcctctt ctccagggag atcacagaag gacagggcct 53640 

cactgaggtc tcggacatgg accctttgat agtggcagga gccaggctgg gcaagaggcg 53700 

gccacagtca cctcagcagt gccatcacca ccgccattca gcccttccct gagccgggcg 53760 

cgcccctggc tctggcccca gtgtcccagt tacagctcac aggagcttgt ggtgcccagc 5382 0 

ggctgcttct gattgagagt cgaggtcgga ggctttggga ggctgagagg ctgctcggtt 53880 

tcacaactgc tgagggagac ttgggctcca tctcaggtct gccccatgtc gccctcaacc 53 94 0 

tccagccacc ggtcctccgt gtcccccatg gccaggcacg gcttgcagac atctgtcgtt 54000 

ggctcctctc agccgtcgtg ggctgaccct ggcacgtcct cctgtggctg agcccagtgg 54060 

ggacagctgc ttccttttat taccctagaa ctctcgtctt tgatcaggcc ccctccccta 5412 0 

tgccacacag tccctgtcac tcgggtgagc ccagtagtca tggggaaggc ctgcgggttc 5418 0 

caaacatcca aaggcttgcg tgcagcatga cagcttgaaa ccgatgtttt ttaccttgat 54240 

cagatttcag cttggcgggg gctttgctca gctttcagtg aggcctgggc cgatttccca 54300 

gcatcccctc ctgaggccag cctctgtttc ctgtgatttt ctgcacaaag tgggagggag 543 60 

gagtcttagg aaatgggggg ccacctcgaa acctaggcct cctctggctt ctctgtgcca 54420 

gtgcccccac gctttgtgtc tgtgtcccca gcccatggga ctgtgttatt ccctgagtgc 54480 

tgccgcatgc ccagcccgca ctgaggacgt ggagccccga ggggcaggat ggcctccatg 5454 0 

gtcacacgta ggaagtggcc tccaccctcc gatgatcctc tccccccctc cctttcagcg 54600 

ccttccccgg gggtgtcatc agccctcctg cctgtgcttt gtcccgtctt ctgcaggcgc 54660 

atgggacgtg ctgacaggtc ctctgccggg ttcctgcctt gctatgcgca cgctggtcac 5472 0 

cacagaggcc tggcccttct tctgtagcag tcccacaccc gcaacaggtg tggctgctga 54780 

ccacctgctt tctgcccctc tggtcctgag gagggcgcag tgggcactca ggcgtggctg 54840 

agcagatgtg tgttgccggg aggaggaagg actgctccag tcagggctga atttcccacc 54900 

cggagcattt ctgctgtatt tggtgtagcg cctgctgctt aaagctctga ttcccagttg 54960 

gcaccctttc ccttctgcat tgaaaaacat acggatgcat gtcttcttgc agtgaatgtg 55020 

tattctccca gcctctcttc tgggttgggg ctggaggtgg agcggcacac aggagccgca 55080 

gcgatggagg atgtgcgggt gcagcacccc gtacagcagg gatgccaaac ccgcgctgag 55140 

tccctctcaa cttctgcttt gaagcccagt cacgccattg cctgggtttt gctgggcggg 55200 

gctgcatgtg atgttctcct ctgtccctcc cccagagccg cccacctgct ccccggacca 55260 

gtttgcatgt gccacagggg agatcgactg tatccccggg gcctggcgct gtgacggctt 55320 

tcccgagtgc gatgaccaga gcgacgagga gggctgcccc gtgtgctccg ccgcccagtt 55380 

cccctgcgcg cggggtcagt gtgtggacct gcgcctgcgc tgcgacggcg aggcagactg 55440 

tcaggaccgc tcagacgagg tggactgtga cggtgaggcc ctccccgtca aggctctgcc 55500 

aagaccctgg ccctgccctc cgggatacga gcttggggct gcctccggcc tcacaggagt 55560 
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aggggctctg aaaacctttg cttgcaggga gattgccaag tctgtctttt aggcccaaca 55620 

aggaaaactc tgcagttcca cccatcctgt cccaccaggt agtgtggctt gaaggcagac 55680 

tgtgagggtc tatctcacct tcctgcatta ggtcaggagt ttcacagaaa cctgaggcac 55740 

attcaggggt gggctgcaga ggtccatggc tcacaccctg gaaaatccgc ccccaaaaga 55800 

cagtgctgtc tccactgacc agtctgtggg atagtgctta agcctgagtg gtttctatca 55 860 

acatgtagaa tcaggaggta taaagagatt tgctcaggca tcctgggccc tctctgacca 5592 0 

gcaggatctt cctttagatc ttgacagtga aacacatctc ttctgtgccc cctgtgagtt 55980 

ttctttcatt cattcattca ttcattcatt cattcattca ttcgagacag agtcttgctc 56040 

tgtcacccag gctggagtgc cctggtgtaa tctcggctca ctgcaacctc tgcctccagg 56100 

gttcaatcga ttctcctgcc tcagcctccc gagtagctgg gatgacaggt gcgcaccacc 56160 

atgcctggct aatttttgta tttttagtag agacagggtt tcaccatgtt ggccaggctg 56220 

gtctcgaact cctgacctca ggtgatccgc ccgcctcagc ctcccaaagt gctgggatta 56280 

caggcatgag ccaccgcgcc cggcctgagt tttcctttta tgaaggacct gcttggttgg 5 634 0 

ttgcctgcca catgttgtca gcaccatggg cccaggactg ctgaggagct gttgatgccc 56400 

tcgctctccc agagccaccg gctctgttag ataattcaca tgcagtctgg ccactgtcct 56460 

acgtcctcat tcacaaagag cagacatttc gtagaagatg agggcctggg agtaacctcc 5652 0 

ctgcatgttt ttctataaag gcatagtggt taagtccttc cagctcattg accattggag 5658 0 

aattttafcgg aggctgtaga ctaggggctg gtaaactaag ggcccagggg ccaaatccag 56640 

cctgccacct acttttgtaa ataaagtttt cttggtgcac agccatgccc attcattcat 56700 

ttgcacaatg tctgtggctg ctttcatgcc aaaagcagga gaactgagtg gttatgctgg 56760 

agacctacgg ccttcaaagc cccagacctc acgtctggcc cttgacagac agagcttccc 5 682 0 

cagccctgct gcgcatcctg gcccagcatg tgctgtgtgt gtgatttcag cttgcaggag 56880 

ccgtggttag gaattgtccc tgtgttggtc cattttgcat tgctatgaag gagcacctga 56940 

ggccgggtag attatgaagg aaagaggtct gtctggctca tggttctgta ggcagcacca 57000 

gtatggcacc cgcatctgct cagcttctag tgaggtctca ggaagctttg actcatggtg 57060 

gaagtcgaag cgggagcagg tgcatcacat ggtgagagag ggagcaacgg agagagagag 5712 0 

agagagagag agagcgcctc tccctcttgc cctcaccttg agaggagatg ccaggctcct 57180 

ttaagtaacc agctcccatg tgaactcaca gtgagagccc atttgctact gcggagaggg 5724 0 

caccaggcat ctgctcccat gacccaaaca ctgcccacca ggccctacct ccaaccttgg 57300 

ggtcatattt tattctgttc tatgctatgc tatgctatgc catgccatgc catgccatgc 57360 

tattcctatt ctattatttg agacagaatc tcgctctgtt gcccaggctg gagtgcagtg 57420 

gcatgatctt ggctcactgc aacctccacc tcccaggttc aagcgattcfc cctgcctcag 57480 

cctcccgagt agctgggatt acaggcacac accaccacac ccgggtaatt tttgtatttt 57540 

caatagagat ggggtttcac catgttggcc aggctggtct caaactcctg gcctcaagtg 57600 

atccacttac ctcggcctcc caaagtgcca tgattacaga tgtgagtcac tgcgcccagt 57660 

gagggtcaca tttccgttga gatttggagg ggcagacgtt ggagccatct gagccccctc 5772 0 

gtcccgctct agcttctcct cccgtgtgcc ccgcggtgct ggtggcaggc ccttacgccg 57780 

gttctggctg cacgctctgt tccagaagct ttcttccctg cttggttacc agaaaatcat 57840 

cccatccatt acaaggacag ggtcccctta tctcccattc ccagggcagg acaccggggg 57900 

cagggcaggt ggggaactga gcaagttctc tgggggcagg cgtggctatg gctccctctg 57960 

ggtgggcgtc tggggagggg tggaggcagc cgtcagcgcc ctggcttgct cttcctccct 58020 

ggccagagac tgtggccttg tgctgctccc gtgtgggctg cctgcacctc cagtgggttg 58080 

tgctccctcc cctcccctcc cctcaagctc tgctgagcac cactgccttc cacagccccc 58140 

actctcggga ggcgaggctc ctcgtggcca ttcctgtcct tggcacccac ccccccacca 58200 

acctggtaga gccttgggcg gggtctgtta ctccttgcat ggcgtagacc tccccacagt 58260 

aggcacctga cacatacctc ctggggggca ggcaggaggt gcgttgaggt ctcagccctg 5832 0 

gcagtccctc ccctgcgtgg cataggcctc gccacagggt catcgagggt gggtggagac 58380 

tgtactagac cactccccgc tggtcctaga aagggtccca tctgtctgct ctctgtttgg 58440 

agtccagacc ttggttgctg tgccctgcat ggtgggctgg ggggcaccct ccagcctctc 58500 

tgagtgcatg gcctctcctt gcagccatct gcctgcccaa ccagttccgg tgtgcgagcg 58560 

gccagtgtgt cctcatcaaa cagcagtgcg actccttccc cgactgtatc gacggctccg 58620 

acgagctcat gtgtggtgag ccagcttctg gcacggggaa ggggcgtccg ggctgggttc 58680 

ccccaggaac gtggagttta ggggaggaga cgtgcctttc cagcggggct gggggctgtg 5874 0 

tgggagactc aggcggctgg gaggctcctt gcgggaggca gggaagcctt tcccagggca 58800 
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gcggccagga ggacagactg tgagctgtgg gctcggcggc tacagagtct gcctcagtgg 58860 

gcggggctga tggtgtccag gtgcctgcag cacgcaccca cccacgggac cttgctgagc 58920 

agcgtctgtc aggcagcaag attacccgag ggctgcagtg gtcctgttcc ctggcagctt 5 8980 

actgtctggc tgaggaggag tgatgttcac atatgcacac atgtcatgtg cacacacatg 5904 0 

tacatgacaa catcccacat gctcctcaaa tagcatgacc tgtacagtca cggatatagg 5910 0 

gcctagggga taggaggcca agacagtcag ggaagacttt ccagaggcag tggctcctga 59160 

aaggctgtct gattcaggca ggaagggagc tgagttcaga taggaagtag caatgagtca 5922 0 

ttgtgtctgg ggacatggcc actccttcgc tgcagaggga cctgggctga gagctcctct 59280 

cttatggctg cagtcgggag agaagtctgt tggggggaga agggggcttc ctcaagggac 59340 

tccctgtgcc ctttggcacc ttcgtgccag gtcaggcttg aggcctgaag gcagtggtgg 59400 

gggccaccaa gggtcgcctc ctctgctggg caagttccca gtctgacggg cctgtgccgt 59460 

gggccccagc tgtgggggcg ctgttgatgc gcagccaggc ctcgccgcca gagcccgcac 59520 

gcttccattc cgctgacttc atcgacgccc tcaggatcgc tgggccggcc ctgtgggaga 59580 

gtgaatgtgg cttttgccaa agttgagtct ggagcctgga aacttcccta tgggcagcct 59640 

tgatagtgga gtggcccaag gagcccaccc agccgaccct gcccctcccg tggctggtgg 59700 

gcggcaccag gggctgcctg gctttgctcg ttcaccaaca tcacccgggc tggccagggc 59760 

gcgctcactt ctgccaccac cgagggccct gggcgaagga gtgaatacca ggctgccttg 59820 

gcagggatgt gttgagggct gtggggagtc ggacagcggc gggggtcaga ggaggaggag 59880 

ggtgcaccgt gcaggctgaa gggccacgtt accctgaggt tggccaggct ccccaggcct 59940 

agcctcccag ctcccccact ttctccccac cctccaccag tggcaaagcc agccccttca 60000 

gggcgcacgg tgtctgcccc caaggagggc ccattccgtt ggggttaatg ttggccacct 60060 

ctttctgttt gtctctggca gaaatcacca agccgccctc agacgacagc ccggcccaca 60120 

gcagtgccat cgggcccgtc attggcatca tcctctctct cttcgtcatg ggtggtgtct 60180 

attttgtgtg ccagcgcgtg gtgtgccagc gctatgcggg ggccaacggg cccttcccgc 60240 

acgagtatgt cagcgggacc ccgcacgtgc ccctcaattt catagccccg ggcggttccc 60300 

agcatggccc cttcacaggt aaggagcctg agatatggaa tgatctggag gaggcaggag 60360 

agtagtctgg gcagctttgg ggagtggagc agggatgtgc taccccaggc cctcttgcac 60420 

atgtggcaga cattgctaat cgatcacagc attcagcctt tcccactgag cctgtgcttg 60480 

gcatcagaat ccttcaacac agaggcctgc atggctgtag caacccaccc tttggcactg 60540 

taggtgtgga gaaagctcct tggacttgac cttcatattc tagtaggaca tgtgctgtgt 60600 

tgtccacaaa tcctcatgta ccctagaaat gaatgtgggg gcggctgggc tctctccaga 60660 

gctgaaggaa tcactctgta ccatacagca gctttgtctt gagtgcagct gggatttgtg 6072 0 

gctgagcagt tacaattcct acgtggccca ggcaccagga acgcaggctg tgtttgtaga 60780 

tggctgggca gccgcaccgc agagctgcac catgctggtt tgtatcacat gggtgaccat 60840 

ggtatgtcta agaaggtgga gtccctgtga ggtctgcagg tgcccccaca gctccaggcc 60900 

accttgagga ttgcctctgc ctgcccagcc ctgagttccc tctcccctgt cctgtcccac 60960 

tgtcacccca agccggcctc attgggagcc tgttggatgg cagggtatag atgtaacctg 61020 

attctctctg gggagcgggg ttatctggct tctcaagagc tcctaggagc ccacagtggt 61080 

ggcaccatca cagtcgcagc agcccccaga gaacgcggcc ctgtctgttc ctggcgtgct 61140 

ctgtgctgcc ccgcctgggt tccctgcccc agtcgcaggc cccttggagg aggtaccatg 61200 

tgtctcccgt ttcacagatg agccccgggg agctcactct agtagtggcc agagaggcct 61260 

gcggctcagg gagcggggca catttccaac aggacacacc gccctggtct gagtctcgtg 6132 0 

ggtagtggga gcagaggaga gcgccctatg tctgtggggc ggcttggctg agcctggaag 61380 

ccacctgacc tcccccgtcc cttccctgcc aggcatcgca tgcggaaagt ccatgatgag 61440 

ctccgtgagc ctgatggggg gccggggcgg ggtgcccctc tacgaccgga accacgtcac 6150 0 

aggggcctcg tccagcagct cgtccagcac gaaggccacg ctgtacccgc cggtgagggg 61560 

cggggccggg gaggggcggg gcgggatggg gctgtgggcc cctcccaccg tcagtgctgg 6162 0 

ccaccggagg cttcccgggt tcctgggggc tgtgccaccg cctctgaggc atgcttgctt 61680 

tcttcccttt fccaaaccctt ctgcttcctt ctttaatgac attgttgatt gtggataatc 61740 

tgaaaactac acaaaaatat aaagagccaa aatctcaccc aaatccacct cctagagtgg 61800 

ctgttgggct ccgtcagcat ccaggcggcc gtctgtgttc cgcacggccc agcccatcga 61860 

tagccgcctg caccaggcct gtctgccctc tgtgagcctc cccacagggt tccctccaca 6192 0 

aacaccctgt tctcccaccc agggctggct gcttcctgga aaacagctgg atggttttgt 6198 0 

gcatgacaga caaacacagg gtgattttcg tggctaaaat actccctgga gcttttggca 62040 
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gggtgagggg ctggctccag ctgagccacg ccttgagtga aatgactgtg aggagaataa 62100 

actgccgctg ccctccagga tcactggggc tggctgggga gaacccccgt ttctgggagc 62160 

acagtcccag gatgccaagg cgagcttggt gccgagatgt gaactcctga gtgtaaacag 62220 

cgggggctga cttgacatgc tttgtatgct tttcatttgt tcctgcagct gtatgcccct 62280 

aaggtgagtc cagccccctt ctgcttcctc tggggcctcg ccagtgagcc ccaccttgct 62340 

ggggctggtt cctcctgccc ttctgggtat ccctcacatc tggggtcttg tcttcttgtt 62400 

ttatttttct tttttttttg agacggagtt tcacttttgt tgcccaggct tcagtgcaat 62460 

ggtgtgatct ctaggctcac cgcaacctct gcctcccagg ttcaagcagt tctcctgcct 62520 

cagcctccct agtagctggg attacaggca tgtgccacca cgcccagcta attttgtatt 62580 

tttagtagag atggggtttc tccatgttgg tcaggctgat cttgaactcc ctacctcagg 62 640 

tgatccgccc accttggcct cccaaagtgc tgggattaca ggcgtgagcc accgcacctg 62700 

gcctttttct tttcttttct tttctttttt ctgagacagg gtctcgctct gtcacccagg 62760 

ctggagtgca atggtgtcat catggctaac tgcagcctct accttctagg ctcaagcaat 62 820 

cctcccatct cagcccctaa gtagctagga ctgcacgcat gcatccccat gcccagctaa 62 880 

tatttacatt ttttgtagag atgaagtttc actatattgc ccaggctggt ctccaactcc 62940 

tggactcgag cgatcctcct gcctcggcct ccccaggtgc tgggattaca ggcgtgagcc 63 000 

accgtgcctg gcctggggta ttgtcttctt atggcacctg actgtggtgg gccctgggaa 63 060 

ggaagtagca gaagagggtt cttcttggtt tcctggacag taactgagtg ttctggaggc 63120 

cccagggcct ggctttgttt agggacaaag ggaactggta accagaagcc gagagtttaa 63180 

acacccactg cccttcttcc ctgctcctgc tgctgcaacc cagcttaacc agccaggagt 63240 

gctaggaacc caagcagggc ccccgagcac acagcaggca gctcacgaat tctcttttcc 63300 

tgttctccct tgggagctgg gaggatctta atcaggcaat aagagatggc actgagcagc 63360 

cagctaattt tttaaatcac tttattgttt aaccatatga ctcacccact taaaaaaggg 63420 

tacagttcag tgggttttag tgtattcaca gatgtgtgca accctcacca cagttaattt 63480 

tagaacattt tcctgcccct aaaagaaact ctgcatgaag ccagctgttt ttaaattagc 63540 

aaagttattt tgcatccttt aaatatatgt tcatggtaca aaattcaaaa gatacagaag 63 600 

agtctgcagt ccaaagagac tccgccccca tgacgccaag caggactccc tgggaggcat 63 660 

ggcctcctgc agtgtgtttc ttctatgtcc ccccaggggt catctgtaca tatgcaagca 6372 0 

tacaagagcg tggactttgt tttccaagcc agaagataat tgtagattta tgtgcagttg 63780 

tgagaaagag cacagaccca tttatcctct gcctggtttc ccccagtgct gcctgccatc 63 840 

ttgcatgact tccattccta tcataagcaa gacactgata acgattcttt caccttattc 63900 

agattgacat aagtgttttt tgtttgttct tgagacaaac ttcctctgtc acccagtggg 63 960 

agtgcagtgg cacaatcaca gctcactgca gcctcaaact cctgggctca agcgattctc 6402 0 

ctgcctcagt cccctcaagt agctcagatg gcaggtgtgc accatcatgc caggctaatt 64080 

tttaaatttt ttgtggaggt gaggcctcac taaatttcct gggctagtct tgaactcctg 64140 

agctaaagtg atcctcctgc ctcagcctcc caaagtggta ggattacagg catgagccac 64200 

tgcgcctggg ctgacatatg tgttttcgta agcccgaaag atagcatctg aagagtcaac 64260 

attgagcctt gccttttgct gctaatgatg tataaaagct gctgttctga gcatttcgga 6432 0 

ggctcccagc tgccgtgtgc accctgccta gagctctacc gtaacccatc tccgggagga 643 8 0 

ggtgctattg ttttcctcat tttgcaacaa ggaggctgaa gaactgagca tgaaccactg 6444 0 

gcctgggtcg ttcggttggt aggcagtggg gccaggccat ccaactcaca accaccttct 64500 

actctgcttc ccccgcaccc tgaagtttgt tctgttttga ggacacagcc gtcacattct 64560 

tggtggctga acagcactcc ttgtcaggtg tggctgggcc cccactggag ggcatcatgg 64620 

tcctctctcc tgctgcggtt gaaccttggc tgtttcaacc actcctgcca agtggccctc 64680 

tgaaagggac agtccatctt ttctcagcag agggccacac tggcaaaacg gtccctggca 6474 0 

ccctttctct ccacctgtct aatatagagt aaaaatggta tcatgttaag atcttcattt 64800 

atatttattt tatcatgaat gatgtaagca tcattttgtg tgtttaagaa cctttgggcc 64 860 

cagcgtgatg gcttgcagct gtaatctcag cactttagga ggctgagatg agcggatcac 64 920 

ttgaggccgg gagtttgaga ccagcctggc caacatggag aaaccccgtc tctagtaaaa 64980 

atttaaaaat tagccgggta tggtgatccc agctacttgg gagtctgaag catgagaatt 65 040 

gcttgaacat gggaggcgga ggttgcagtg agccgagatc gcgccattgc actccagcct 65100 

gggcgacaga gcgagactct gtctcacaaa aaaaaaaaaa aaagaaaaga aaagaaatta 65160 

tcaatctcct cttttatggc atatatatat atatatatat atatatatat ttatttccct 6522 0 

ttcttggtta tgttcataaa ggcctcccct gctctgatca taaaaaacaa cttattttca 6528 0 
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cactctctct cttttttttt tgagacagag ttttgctcct gttgcccagg ctggagtgca 65340 

gtggcgcaat ctcagctcac tgtaacctcc gcctcccggg ttggagtgat tctcctgcct 65400 

taccttcccg agtagctggg attataggca tgcaccacca tgcctggcta attttgtact 65460 

tttagtagag acgggggttt ctccatgttg gtcaggctgg tctcgaactc gcgacctcag 65520 

gtgatccacc cacctcggcc tcccaaagtg ctgggattac agacgtgagc caccatgccc 65580 

agcccacact ctctttctta acgtcctcct cctttcgttt tacgttcaca tctttaattc 65640 

ttctgggatg taattagatt tgatgagcaa ggtgggcatc cagcttgttt cttggctgat 65700 

ggcttatggg tggcgtgaat tagtcggggt ctatcaggag gcagaaactc tatgagaatt 65760 

tgaacagaga aagttccgtc tacaggctta ttaccaggga ctggaatagc agaaattgaa 65 82 0 

cagtgagatg tacagagaac tctaagaatg caggaatagg ccaggcatgg tggctcacac 65 880 

ctgtcatccc agcactttgg gagaccaagg cgggtggatc acctgaggtc aggagttcga 65940 

gaccagcctg gccaacatag tgaaacccca tctctactaa aaatacaaaa aaattagctg 66000 

ggtgtggtgg cgcatgcctg taatcccagc ttctcgggag tctgaggctg gagaatcact 66060 

tgaacctggg aggcagaggt tgtagtgagc cgagatcatg ccattgtact ccagcctggg 6612 0 

caacaagagc gagactcagt caaaacaaca acaacgcagg aatagcagat gagccgaggt 66180 

ggggcctccc cagcccccac cccccacccc gcaccctggg ccgagatcca gtcctctttg 6624 0 

aatagggcct gggcgtggtt cacgggacat ctgagacatt gccgaggcgc tgcactggtg 663 00 

gatcttgcca gaagtctgcc cagtgcagat ttgggcagaa tctcaaactg ccttgggatg 66360 

taggagagaa accaggcctg gtcaagttca tgggaagagg tggaaacaga ccccataggc 6642 0 

tggggcttgg gcagctgtag gaagccctct ctgctgcctc cctgcctgct ctctgctttg 66480 

aagcatcttc cccagtgccc ccagtctcat gccctctcaa cgttggggtc aaatcctgag 66540 

gaatacccag actggctctc tgggccaaag aggaccctct ccagaaagag cagggcccag 66600 

tgcggcttcc taaagggcag gggaagggcc tggccactcc ccagaggcta ctcaccagcc 66660 

atcaggatag ccccaggaag caggccttct cgagcccatt ttattacttt attttattat 6672 0 

tttatttaat tttaaattta ttttttgaga cagagtctca ctctgttgcc caggctggag 66780 

tgcagtggtg cgatctcaac ccactgcagc ctctgcctcc agggttcaag ggattctccc 66840 

acctcagcct cccaagtagc tgggattaca ggtgcccgcc accacacccg gctaattttc 66900 

atatttttag tagagacgag gtttcaccat gttggccagg ctggtctcga actcctgacc 66960 

tcaagtgatc cgcccgcctc ggcctcccaa agtgctaggt caagcccatt ttaaagttga 67020 

agaaactgag gctgaggtaa attccctccc cagggatcct gctgcagcca gaaggtggta 6708 0 

aaacaggact tcacccgggt ctgtctggcg tgaaaggcag tgttcttgta ccaccctagg 6714 0 

gggcctgaga gaactgagtc cctcgggcat aactgacagt tctgttccca ttattccgca 672 00 

ggggctcgga tctggctgta tgctttccag gatggccttg gagacccaca taagccctac 672 60 

accctttggg aagctgcatg ttgggttggg gtgccgtcag tggcacttgt ggaaggtgca 6732 0 

gacctgtgtg ggtgtgtggg cccagggccc ctggtccctt cctccctttg tagggctggt 6738 0 

tgtgtgctgc ctggacctgg ggggcacgtt cacgtggtga atttgtctat ttactatccc 67440 

cgctttgggg ctggtgccag cacaggccct tgtgaagggg gtgcctttgt ctggagtggg 67500 

actgtggccc ctccctcagc gtggtgactt ctgtgtcagg gcttcagcag ggacgcagag 67560 

cccctgagtg ttcggaacaa gggcgtcatt gcaggagtta gactgtgtgt gatggaggga 6762 0 

9gaggggcag gaggaaaggt cagaaggaga gttcctggga aggtccctga ggagcctggt 67680 

gaggtgctaa ctggtgtgga ggacactcag ggcctgtggg gacatctcct actgctgggg 6774 0 

gccagccaca aagggaactg gccgaagtcc tgtccccgcc ttcacagccc agcatctggt 67800 

cacaaggcag gtacttggaa gggcgcgggc acctgggcca aaagtgcctg ggttcccttt 67860 

gcctttcact gagatgacct tcggggcagg tggctgctgc ctcccctcct gtccccaggt 67920 

tttgccaact ggccagagga aggggtcctg ggaagcaggg gggccagaag ccctctctgc 67980 

aaggaaagcc cgaggggtgt gggaggaagg aaggaatgcc caggctggcg aggctctaag 68040 

tcaccctggc ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc 68100 

tccctgtaca acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac 68160 

aggtaggaca tcccctgcag ccctccatgg ccattgggtt cccgccagcc cgtggtggag 68220 

gggcctaatc cccatgccac tgatgagggg aggtattctg ggtgctagtg ggcaggtgcc 682 80 

gggcccagcc ctgcctccct ctgctctgcc aaccacacta ggctgcctcc ccagacaagc 68340 

tcagcgggca ctgcatgttg ggttcagaaa tcagcagaac tccacgttct gagctgctct 684 00 

tcaagttgct cctatggggg ttacttttaa gctgggaaat ggctgtggcg tcgaggggcc 684 60 

gggggcttgg gctccaaact ctgactgtgt gtttgagtcc ggctgtggaa acctagccat 6852 0 
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tgagatgccc cctcttggtg gctctgtcct 
tgcagcaccc accgtagacc cctaatcgtg 
cttgccacag tggcctgttt gagcccggga 
ccccccgagc tgcccatgtt ggggtcacag 
tgagagtgtg gagaggtggg acccaggtgt 
tgggatgggc agggtgagcc tcctgacctt 
cccgccagcc gcaccaggcc ccacccccgc 
gccaggacac ctctggacca cgcattcctc 
cccgagtgtg gtgccgcctg ccacctagtg 
gggagccgga actgcagcct ccatttccac 
cctcagtatt atatcaaact gtcacagtct 
gaatggtcac atccacactg ggcagcccag 
tttgctcatg cccgtcctgc tctgggccca 
ctgacctcac agcagctggg ccccaagagg 
acccggtgtt ggctgcacct tcccacccat 
cctaaacaca ccacagttcc gagtctgaac 
acagttgggt gcaggtagcc tctgggagga 
gttgcttgaa ctcaggatgg aagtgttccg 
cacgctgcta ataaagacat acccaagact 
actcacagtt ccacctgcct ggggtggcct 
agtcacatct tacatggctt cagggaacag 
cccttgtaaa accatcaagt ctagtgagat 
gaaccacccc catgattcaa tcatctccca 
tgggagtaca attcaagatg agatttgggt 
acatttacag taacagtgtt aggtgaacag 
tttctagcac cttccaggca gaatctcatg 
catccccact tttatcaatg agaaactaaa 
aggtcagcta gcaggctcta gaggcctcgt 
ggcttggcag gctgcagggc aggtgcgggc 
ctcagagagg gtaagtgatg agccctggcg 
gcatcttctg gagctggcca gtggacaggc 
gcccaccagg gcggatgtgc ctaccgaatc 
ctacatcatt cgaggaatgg cgcccccgac 
cgactacagc gccagccgct ggaaggccag 
agacccctat ccacccccac ccacgcccca 
cccgccctcg cccgccaccg agaggagcta 
ctgcacggac tcatcctgac ctcggccggg 
agttttaaat atgaacaaag aaaaaaatat 
ggattttaaa aacatgagaa atgtgaactg 
gtacagtgga gaaatattta taaacttaat 
ccctgtgtgc atttgagttg tgtgtccccg 
gagagtcctc ctgcacccgg gcgtccctct 
ttgggcagag gtggatatct cccctgggat 
tatgtgcgtg tggcctgtgc cgtcagaaag 
cgtcttcctc agattcttag tctttttttt 
ctgtcatcca ggctggagtg cagtggtaca 
ggttcaagtg agtctcctgc ctcagcctcc 
cacacccgcc cagctaattt ttgtattttt 
ggatgatctc gatctcttga cctcgtgatc 
ttataggcat gagccactgt acccagctga 
ccttcatttt tcactgggcc ctgcagaata 
gtgttggctt ccatctgacc tctctgtgcc 
aggagcccca tgactgcgtg ttgactcgct 
ccacgtgccc ctggcaggga gggaatccct 



cttaggatgg gacaagtctg tgaaggctgc 68580 

tgacgtcacc aggatggtcc gggctgctca 68640 

agccaacggg gctgctcagc tggacaccag 68700 

gccccacctc cctggttggg gaggggcaac 68760 

gctggtctcc gcaggggctg gatcagagcc 68820 

taacccagtg gtgtcaggca acgtggccca 68880 

aggtgaaggg gtgggatagg ctgggcctgg 68940 

attgcttggg tccctggagc agcagggcct 69000 

gccatttcca cgaactccca ggcctggctg 69060 

cccactccgg gtcgggccac ctccctgatg 69120 

gtcccacagc cttacagacc actgtctcca 69180 

tctcgctagt tcctcgtccc acctcctgcc 69240 

ccgcggacac atcttccccc cgcccgccgt 693 00 

agtatcctgt cctgctgcac ttttctcaac 69360 

tgcaggcccc tctgtgacag gacgggggct 69420 

tcacacagtg ggatgcggcg tttctgggcc 694 80 

tgggaggtca ggagccatct tgcgagtcag 6954 0 

ggcccattgg ttgctgtatt agcctgttct 69600 

gggtaattgt aaaggaaaga ggtttaacgg 69660 

cacaatcatg gtagaagaca aggaggagca 69720 

acagcatgag aaccaagcga aaggggtttc 69780 

ttattcacta ccacgagaac agtatggggg 6984 0 

ctgggtccct cccacagcac gtgggaatta 69900 

ggggacacag ccaaacccta tcggttgcca 69960 

ttgtccagtc tcctgttttg tcggacactg 70020 

tatccttcac tttcgaaatg ggtactattt 70080 

gctcgaagag gtcaagtaag ttcctggcca 7 0140 

tctccttaga ggcagccttg ccagggccca 70200 

atgcccatgg tagaggtggg accattgagg 702 60 

acacagcggg gtgggtccag agtccggcct 70320 

ctttcccgtt cacagccccg gggctgctgt 70380 

ccactcctct gtgtgtgtcc ctttcaggcc 70440 

gacgccctgc agcaccgacg tgtgtgacag 70500 

caagtactac ctggatttga actcggactc 70560 

cagccagtac ctgtcggcgg aggacagctg 70620 

cttccatctc ttcccgcccc ctccgtcccc 70680 

ccactctggc ttctctgtgc ccctgtaaat 70740 

attttatgat ttaaaaaata aatataattg 70800 

tgatggggtg ggcagggctg ggagaacttt 70860 

tttgtaaaac agaactgcca ttcttttgtg 70920 

tggagggaat gccgaccccc ggaccaccat 70980 

gtccggctcc tgcagggaag ggctggggcc 71040 

gcatccctga gctgcaggcc gggccggctt 71100 

ggccctgggc ttcatcacgc tgttgctgtt 71160 

tttttttttt ttttgagacg gagtctttct 71220 

atctcagctc actgcaagct ccgactccca 71280 

cgagtagctg ggactacagg tgcgcgccac 71340 

agtagagatg gggtttcacc atgttggcca 71400 

cgcccacctc ggcctcccaa agtgctggga 71460 

ctcttagtca cttttaagaa ggggactgtg 71520 

tatgcctggg ctctgggctc ttctgaacct 71580 

agcccaaggc tgctgctctt cctgagggca 71640 

ggatggggct gctgagccca ctctgccaca 71700 

gggtcctcac aggaacagtc agcaagccac 71760 
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acctgacgcc tgctgtgggc ccatccctgc ggtgctggag aagacagaca aggcctggtc 71820 

actgcctctg cagggtcccc agtccgtgga aggagacagt aatctaggca ttttcggtgg 71880 

ggaagctgag ctgttctcgt gtcctgaagg ccaggcggga acagccgtct tcagagggaa 7194 0 

gggagaaaat gcacatcgca tcagtggaga agggcctgac ttccctcagc atggtggagg 72000 

gaggtcagaa aacagtcaag cttgagtatt ctatagtgtc acctaaata 72 04 9 

<210> 10 
<211> 8705 
<212> DNA 

<213> Homo sapiens 
<400> 10 

ggactcaggg gcagcaggga ggtacaccca tggttagtgg gcggaccata gggggtaatg 60 

agagggtgaa tcgatggaac ctgggggaca caatcgaagt ggttccagag tcgggctgta 120 

ctaattaaag agacggggca gtggacaggc attttcagtt gactgcccag ggagtgttct 180 

gcccaacagg gaggatatgc gtacagaatc atactcgatc agcatgagtc caattcagac 24 0 

cgtacatcag tggagatatg ggtcccccga tgactccgtg gaacactgat gtttgtgaca 300 

ggggagtaca gcaccagcca tcagcaggcc agtaaatcat accggcctgc gaaattggac 3 60 

tcagacccgg atccaccctg accgacgtcc caagccccca ccccccaccc cccaccatgg 420 

gccgagatcc agtcctcttt gaatagggcc tggccgtggt tcacgggaca tctgagacat 480 

tgccgaggcg ctgcattggt ggatcttgcc agaagtttgc ccagtgcaga tttgggcaga 540 

atctcaaact gccttgggat gtaggagaga aaccaggcct ggtcaagttc atgggaagag 600 

gtggaaacag accccatagg ctggggcttg ggcagctgta ggaagccctc tctgctgcct 660 

ccctgcctgc tctctgcttt gaagcatctt ccccagtgcc cccagtctca tgccctctca 720 

acgttggggt caaatcctga ggaataccca gactggctct ctgggccaaa gaggaccctc 780 

tccagaaaga gcagggccca gtgcggcttc ctaaagggca ggggaagggc ctggccactc 840 

cccagaggct actcaccagc catcaggata gccccaggaa gcaggccttc tcgagcccat 900 

tttattactt tattttatta ttttatttaa ttttaaattt attttttgag acagagtctc 960 

actctgttgc ccaggctgga gtgcagtggt gcgatctcaa cccactgcag cctctgcctc 1020 

cagggttcaa gggattctcc cacctcagcc tcccaagtag ctgggattac aggtgcccgc 1080 

caccacaccc ggctaatttt catattttta gtagagatga ggtttcacca tgttggccag 1140 

gctggtctcg aactcctgac ctcaagtgat ccgcccgcct cggcctccca aagtgctagg 1200 

tcaagcccat tttaaagttg aagaaactga ggctgaggta aattccctcc ccagggatcc 12 60 

tgctgcagcc agaaggtggt aaaacaggac ttcacccggg tctgtctggc gtgaaaggca 1320 

gtgttcttgt accaccctag ggggcctgag agaactgagt ccctcgggca taactgacag 1380 

ttctgttccc attattccgc aggggctcgg atctggctgt atgctttcca ggatggcctt 1440 

ggagacccac ataagcccta caccctttgg gaagctgcat gttgggttgg ggtgccgtca 1500 

gtggcacttg tggaaggtgc agacctgtgt gggtgtgtgg gcccagggcc cctggtccct 1560 

tcctcccttt gtagggctgg ttgtgtgctg cctggacctg gggggcacgt tcacgtggtg 1620 

aatttgtcta tttactatcc ccgctttggg gctggtgcca gcacaggccc ttgtgaaggg 1680 

ggtgcctttg tctggagtgg gactgtggcc cctccctcag cgtggtgact tctgtgtcag 1740 

ggcttcagca gggacgcaga gcccctgagt gttcggaaca agggcgtcat tgcaggagtt 1800 

agactgtgtg tgatggaggg aggaggggca ggaggaaagg tcagaaggag agttcctggg 1860 

aaggtccctg aggagcctgg tgaggtgcta actggtgtgg aggacactca gggcctgtgg 1920 

ggacatctcc tactgctggg ggccagccac aaagggaact ggccgaagtc ctgtccccgc 1980 

cttcacagcc cagcatctgg tcacaaggca ggtacttgga agggcgcggg cacctgggcc 2040 

aaaagtgcct gggttccctt tgcctttcac tgagatgacc ttcggggcag gtggctgctg 2100 

cctcccctcc tgtccccagg ttttgccaac tggccagagg aaggggtcct gggaagcagg 2160 

ggggccagaa gccctctctg caaggaaagc ccgaggggtg tgggaggaag gaaggaatgc 2220 

ccaggctggc gaggctctaa gtcaccctgg cttggctctc ctcagatcct gaacccgccg 2280 

ccctccccgg ccacggaccc ctccctgtac aacatggaca tgttctactc ttcaaacatt 2340 

ccggccactg cgagaccgta caggtaggac atcccctgca gccctccatg gccattgggt 2400 

tcccgccagc ccgtggtgga ggggcctaat ccccatgcca ctgatgaggg gaggtattct 2460 

gggtgctaat gggcaggtgc cgggcccagc cctgcctccc tctgctctgc caaccacact 252 0 
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aggctgcctc cccagacaag ctcagcgggc actgcatgtt gggttcagaa atcagcagaa 2580 

ctccacgttc tgagctgctc ttcaagttgc tcctatgggg gttactttta agctgggaaa 264 0 

tggctgtggc gtcgaggggc cgggggcttg ggctccagag tctgactgtg tgtttgagtc 2700 

cggctgtgga aacctagcca ttgagatgcc ccctcttggt ggctctgtcc tcttaggatg 2760 

ggacaagtct gtgaaggctg ctgcagcacc caccgtagac ccctaatcgt gtgacgtcac 2 820 

caggatggtc cgggctgctc acttgccaca gtggcctgtt tgagcccggg aagccaacgg 2 880 

ggctgctcag ctggacacca gccccccgag ctgcccatgt tggggtcaca ggccccacct 2 940 

ccctggttgg ggaggggcaa ctgagagtgt ggagaggtgg gacccaggtg tgctggtctc 3 0 00 

cgcaggggct ggatcagagc ctgggatggg cagggtgagc ctcctgacct ttaacccagt 3 060 

ggtgtcaggc aacgtggccc acccgccagc cgcaccaggc cccacccccg caggtgaagg 312 0 

ggtgggatag gctgggcctg ggccaggaca cctctggacc acgcattcct cattgcttgg 3180 

gtccctggag cagcagggcc tcccgagtgt ggtgccgcct gccacctagt ggccatttcc 3240 

acgaactccc aggcctggct ggggagccgg aactgcagcc tccatttcca ccccactccg 3300 

ggtcgggcca cctccctgat gcctcagtat tatatcaaac tgtcacagtc tgtcccacag 3360 

ccttacagac cactgtctcc agaatggtca catccacact gggcagccca gtctcgctag 342 0 

ttcctcgtcc cacctcctgc ctttgctcat gcccgtcctg ctctgggccc accgcggaca 3480 

catcttcccc ccgcccgccg tctgacctca cagcagctgg gccccaagag gagtatcctg 3540 

tcctgctgca cttttctcaa cacccggtgt tggctgcacc ttcccaccca ttgcaggccc 3600 

ctctgtgaca ggacgggggc tcctaaacac accacagttc cgagtctgaa ctcacacagt 3 660 

gggatgcggc gtttctgggc cacagttggg tgcaggtagc ctctgggagg atgggaggtc 3 720 

aggagccatc ttgcgagtca ggttgcttga actcaggatg gaagtgttcc gggcccattg 3780 

gttgctgtat tagcctgttc tcacgctgct aataaagaca tacccaagac tgggtaattg 3 84 0 

taaaggaaag aggtttaacg gactcacagt tccacctgcc tggggtggcc tcacaatcat 3 900 

ggtagaagac aaggaggagc aagtcacatc ttacatggct tcagggaaca gacagcatga 3 960 

gaaccaagcg aaaggggttt ccccttgtaa aaccatcaag tctagtgaga tttattcact 4 02 0 

accacgagaa cagtatgggg ggaaccaccc ccatgattca atcatctccc actgggtccc 40 8 0 

tcccacagca cgtgggaatt atgggagtac aattcaagat gagatttggg tggggacaca 4140 

gccaaaccct atcggttgcc aacatttaca gtaacagtgt taggtgaaca gttgtccagt 4200 

ctcctgtttt gtcggacact gtttctagca ccttccaggc agaatctcat gtatccttca 4260 

ctttcgaaat gggtactatt tcatccccac ttttatcaat gagaaactaa agctcgaaga 432 0 

ggtcaagtaa gttcctggcc aaggtcagct agcaggctct agaggcctcg ttctccttag 43 8 0 

aggcagcctt gccagggccc aggcttggca ggctgcaggg caggtgcggg catgcccatg 444 0 

gtagaggtgg gaccattgag gctcagagag ggtaagtgat gagccctggc gacacagcgg 4500 

ggtgggtcca gagtccggcc tgcatcttct ggagctggcc agtggacagg cctttcccgt 4560 

tcacagcccc ggggctgctg tgcccaccag ggcggatgtg cctaccgaat cccactcctc 4620 

tgtgtgtgtc cctttcaggc cctacatcat tcgaggaatg gcgcccccga cgacgccctg 4680 

cagcaccgac gtgtgtgaca gcgactacag cgccagccgc tggaaggcca gcaagtacta 4740 

cctggatttg aactcggact cagaccccta tccaccccca cccacgcccc acagccagta 4800 

cctgtcggcg gaggacagct gcccgccctc gcccgccacc gagaggagct acttccatct 4860 

cttcccgccc cctccgtccc cctgcacgga ctcatcctga cctcggccgg gccactctgg 4920 

cttctctgtg cccctgtaaa tagttttaaa tatgaacaaa gaaaaaaata tattttatga 4980 

tttaaaaaat aaatataatt gggattttaa aaacatgaga aatgtgaact gtgatggggt 5040 

gggcagggct gggagaactt tgtacagtgg agaaatattt ataaacttaa ttttgtaaaa 5100 

cagaactgcc attctttcgt gccctgtgtg catttgagtt gtgtgtcccc gtggagggaa 5160 

tgccgacccc cggaccacca tgagagtcct cctgcacccg ggcgtccctc tgtccggctc 522 0 

ctgcagggaa gggctggggc cttgggcaga ggtggatatc tcccctggga tgcatccctg 5280 

agctgcaggc cgggccggct ttatgtgcgt gtggcctgtg ccgtcagaaa gggccctggg 5340 

cttcatcacg ctgttgctgt tcgtcttcct cagattctta gtcttttttt tttttttttt 5400 

ttttttgaga cggagtcttt ctctgtcatc caggctggag tgcagtggta caatctcagc 5460 

tcactgcaag ctccgactcc caggttcaag tgagtctcct gcctcagcct cccgagtagc 552 0 

tgggactaca ggtgcgcgcc accacacccg cccagctaat ttttgtattt ttagtagaga 5580 

tggggtttca ccatgttggc caggatgatc tcgatctctt gacctcgtga tccgcccacc 5640 

tcggcctccc aaagtgctgg gattataggc atgagccact gtacccagct gactcttagt 57 00 

cacttttaag aaggggactg tgccttcatt tttcactggg ccctgcagaa tatatgcctg 5760 
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ggctctgggc tcttctgaac ctgtgttggc ttccatctga cctctctgtg ccagcccaag 5820 

gctgctgctc ttcctgaggg caaggagccc catgactgcg tgttgactcg ctggatgggg 58 80 

ctgctgagcc cactctgcca caccacgtgc ccctggcagg gagggaatcc ctgggtcctc 5 940 

acaggaacag tcagcaagcc acacctgacg cctgctgtgg gcccatccct gcggtgctgg 60 0 0 

agaagacaga caaggcctgg tcactgcctc tgcagggtcc ccagtccgtg gaaggagaca 6060 

gtaatctagg cattttcggt ggggaagctg agctgttctc gtgtcctgaa ggccaggcgg 6120 

gaacagccgt cttcagaggg aagggagaaa atgcacatcg catcagtgga gaagggcctg 6180 

acttccctca gcatggtgga gggaggtcag aaaacagtca agcttgttgc tgggtgacag 6240 

tgcatttaat aatcaaaata taggctgggt acggtggctc atgcctgtaa tcccagcact 63 0 0 

ttgggaggct gaggcaggtg gatcacttga ggccaggagt ttgagaccgg cctggccaac 63 60 

atggcaaaac ctcaactact aaaatacaaa aactagccgg gcgtggtggt gcacgcctgt 6420 

aatcccagct acttgggagg ctgaggcagg agaattgctt gaacctggga ggcggaggct 64 80 

gcagtgagcc gagattgtgc cactgcactc cagcctgggc aacagagcaa gactctgtct 6540 

caaaaaaaaa aaaaaaaaaa gcaatacaaa atacaaatat cactttcact aaaagaaggg 6600 

atggaagacc caaaacaaac agaaaacaac aaaatggcag gagtaagtcc ccacttatca 6660 

ataataacat tgactgtaaa taggctaagc tctgcaatca aaagagtggg ccaggagcgg 6720 

tggctcacgc ctgtaattcc aacgctttgg gaggctgagg cggatggatc atttgatgtc 6780 

acgagtttta agaccagcct ggccaacaag gtgaaacccc atctgtacta aaaatacaaa 6840 

aattagccag gcggtagtgg cacgcacctg taatcccagc tacttgtgag gctgaggcag 6900 

gagaatcact ggaggctggg aagcggaggt tgctgtgagc caagatggag ccactgcact 6960 

cccacctggg cgacagagtg agatcctgtc ttaagaaaaa aaagagtgga tgaatggatc 7 020 

aaaaaacaag acccaaccat ctcttgcata caagaaacac actttaccta taaaaacaca 7080 

ctaggccagg tgtggtggct cacacctgta atcccagccc tttgggaggc ctgactggca 714 0 

gatcacctga ggccaggagt ttcagaccag cttgaccgac atggcaaaac cccatctctc 72 00 

ctaaaaatac aaaaaaacaa aaaaaagaaa aaggctggaa gtagtgatgt gtgcctgtag 72 60 

ccccagctac ttgggaggct gaggcaggag aattgcttga atccgggaag tggaggttgc 73 20 

agtgagccag gatggtgcca ctgcactcca gcctgggtga cagagcgaga ccctgtcata 73 80 

aaaaaaaaaa gaaaagaaaa gaaaaacgag aaaaacaaac acaaaattag tagaagaaaa 744 0 

gaaataataa agatcagaac aggccaggct catgggcaca gtggctcaac tcctacctgc 75 00 

tcaggagttt gagaccagtc tggccaacat ggcaaaaccc catctctcct aaaaatatga 7560 

aaaaaaaaaa ataggctgga tgtggtgatg tgtgtgtgcc tgtagcccca gctacttggg 7620 

aggctgaggt gggagaatca cttgagccca ggaagtggag gctgcagcga gtcatgaatg 7680 

caccctgcac tctagctggg taactggagt gagattctgt ctcaaaaaag caaagaccag 774 0 

agcagaaata aatgaaatgg aaatgaagga aacaatgcaa aatgatacaa aaagtttttt 7800 

cgaaaagata aacaaaatca acaaaccttt agccagatta agaaaaaaag agagaagacc 7 860 

caaataaata aaatccgaga ttaaaaagga gacattacca ctgataccac agaaattcaa 792 0 

aggatcatta gaggcaacta tgtgcaacta tatgctaatg aactggaaaa cctagaagaa 7980 

ctgggtaaat ttctagacac atacaaccta tcaagattga accatgaaga aatccaaaac 8 040 

ctgaacaggc cgggcacggt ggcttacgcc tgtaatccca gcactttgga aggcctgaga 8100 

tcaggagttc gagaccagcc tggccaacat ggtgaaaccc catctctact gaaaaaatat 8160 

aaaaattagc cgggcgtggt ggcgggtgcc tctaatgtca gccactcggg aggctgaggc 8220 

aggaaaatca cttgaacctg ggaggcatag gttgcagcga gccgaggttg caccactgca 8280 

ctccagcctt ggcgacagag ccagactcca tctcaaaaaa attaaaataa caaaaacctg 8340 

aacagaccaa taacaagtaa tgcgatgaaa actgtaataa aatgtttccc aacaaagaaa 84 00 

gcccaggaac aaatggcttc actgctgaat tttaccaaac attttttttt ttttgagacg 8460 

gagtctcgct ctgtcgccca ggctggagtg cagtggtgta acctcggttc gctggtaact 8520 

tatgcctctc aggctgcaag tgattttcct gcttcaggcc ccccgagtgg ctggaaatta 8580 

gatggtactt gtcaaacaag gcctggctaa atttctatat ttccttcaag tagaagatgt 8640 

gcttccaaca aaggttgggt tacggctggc ttctgaaaat cttggatttc aaggctcccc 8700 

aaaag 8705 

<210> 11 
<211> 66933 
<212> DNA 
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<213> Homo sapiens 
<400> 11 

tataatcaag cgcgttccgt ccagtccggt gggaagattt tcgatatgct tcgtgatctg 60 

ctcaagaacg ttgatcttaa agggttcgag cctgatgtac gtattttgct taccaaatac 120 

agcaatagta atggctctca gtccccgtgg atggaggagc aaattcggga tgcctgggga 180 

agcatggttc taaaaaatgt tgtacgtgaa acggatgaag ttggtaaagg tcagatccgg 240 

atgagaactg tttttgaaca ggccattgat caacgctctt caactggtgc ctggagaaat 300 

gctctttcta tttgggaacc tgtctgcaat gaaattttcg atcgtctgat taaaccacgc 360 

tgggagatta gataatgaag cgtgcgcctg ttattccaaa acatacgctc aatactcaac 420 

cggttgaaga tacttcgtta tcgacaccag ctgccccgat ggtggattcg ttaattgcgc 480 

gcgtaggagt aatggctcgc ggtaatgcca ttactttgcc tgtatgtggt cgggatgtga 540 

agtttactct tgaagtgctc cggggtgata gtgttgagaa gacctctcgg gtatggtcag 600 

gtaatgaacg tgaccaggag ctgcttactg aggacgcact ggatgatctc atcccttctt 660 

ttctactgac tggtcaacag acaccggcgt tcggtcgaag agtatctggt gtcatagaaa 72 0 

ttgccgatgg gagtcgccgt cgtaaagctg ctgcacttac cgaaagtgat tatcgtgttc 780 

tggttggcga gctggatgat gagcagatgg ctgcattatc cagattgggt aacgattatc 840 

gcccaacaag tgcttatgaa cgtggtcagc gttatgcaag ccgattgcag aatgaatttg 900 

ctggaaatat ttctgcgctg gctgatgcgg aaaatatttc acgtaagatt attacccgct 960 

gtatcaacac cgccaaattg cctaaatcag ttgttgctct tttttctcac cccggtgaac 102 0 

tatctgcccg gtcaggtgat gcacttcaaa aagcctttac agataaagag gaattactta 1080 

agcagcaggc atctaacctt catgagcaga aaaaagctgg ggtgatattt gaagctgaag 1140 

aagttatcac tcttttaact tctgtgctta aaacgtcatc tgcatcaaga actagtttaa 1200 

gctcacgaca tcagtttgct cctggagcga cagtattgta taagggcgat aaaatggtgc 12 60 

ttaacctgga caggtctcgt gttccaactg agtgtataga gaaaattgag gccattctta 1320 

aggaacttga aaagccagca ccctgatgcg accacgfcttt agtctacgtt tatctgtctt 13 80 

tacttaatgt cctttgttac aggccagaaa gcataactgg cctgaatatt ctctctgggc 1440 

ccactgttcc acttgtatcg tcggtctgat aatcagactg ggaccacggt cccactcgta 150 0 

tcgtcggtct gattattagt ctgggaccac ggtcccactc gtatcgtcgg tctgattatt 1560 

agtctgggac cacggtccca ctcgtatcgt cggtctgata atcagactgg gaccacggtc 162 0 

ccactcgfcat cgtcggtctg attattagtc tgggaccatg gtcccactcg tatcgtcggt 168 0 

ctgattatta gtctgggacc acggtcccac tcgtatcgtc ggtctgatta ttagtctgga 1740 

accacggtcc cactcgtatc gtcggtctga ttattagtct gggaccacgg tcccactcgt 1800 

atcgtcggtc tgattattag tctgggacca cgatcccact cgtgttgtcg gtctgattat 1860 

cggtctggga ccacggtccc acttgtattg tcgatcagac tatcagcgtg agactacgat 192 0 

tccatcaatg cctgtcaagg gcaagtattg acatgtcgtc gtaacctgta gaacggagta 1980 

acctcggtgt gcggttgtat gcctgctgtg gattgctgct gtgtcctgct tatccacaac 2040 

attttgcgca cggttatgtg gacaaaatac ctggttaccc aggccgtgcc ggcacgttaa 2100 

ccgggctgca tccgatgcaa gtgtgtcgct gtcgacgagc tcgcgagctc ggacatgagg 2160 

ttgccccgta ttcagtgtcg ctgatttgta ttgtctgaag ttgtttttac gttaagttga 222 0 

tgcagatcaa ttaatacgat acctgcgtca taattgatta tttgacgtgg tttgatggcc 2280 

tccacgcacg ttgtgatatg tagatgataa tcattatcac tttacgggtc ctttccggtg 2340 

atccgacagg ttacggggcg gcgacctcgc gggttttcgc tatttatgaa aattttccgg 2400 

tttaaggcgt ttccgttctt cttcgtcata acttaatgtt tttatttaaa ataccctctg 2460 

aaaagaaagg aaacgacagg tgctgaaagc gagctttttg gcctctgtcg tttcctttct 2520 

ctgtttttgt ccgtggaatg aacaatggaa gtccgagctc atcgctaata acttcgtata 2580 

gcatacatta tacgaagtta tattcgatgc ggccgcaagg ggttcgcgtc agcgggtgtt 2640 

ggcgggtgtc ggggctggct taactatgcg gcatcagagc agattgtact gagagtgcac 2 7 00 

catatgcggt gtgaaatacc gcacagatgc gtaaggagaa aataccgcat caggcgccat 2760 

tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta 2 820 

cgccagctgg cgaaaggggg atgtgctgca aggcgattaa gttgggtaac gccagggttt 2880 

tcccagtcac gacgttgtaa aacgacggcc agtgaattgt aatacgactc actatagggc 2 940 

gaattcgagc tcggtacccg gggatcctct agagtcgacc tgcaggcatg caagcttctc 3 0 00 

ttgtgccggt tgtacgctgt caggtcacac tggtgagtta ggcagggcac agatgcccag 3060 
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agcagaggga actttccttg gggattcaac acgtgcaagt cttaggggct ggcaaatcct 312 0 

gccctcagct agagaggggg cttttatttg agaccagaat cacctgagca tcctcctgtc 3180 

cccagctgtg tccagcctgt ctgcagggac atcctgagag gaccaggctc tcccctcatc 324 0 

cacctgccta agtgccactc tgaaccctgt ccacctgtgc cgtggagggg cgtgacctca 33 00 

agctgctcag ccagcagcag gcttggccct ggggggcagc agagacccag gtggctgtgg 3360 

ggtgggtgct tcgtggcgtg gttctgaaac ttcgttggaa gtgtgtggac agtgccttgc 342 0 

ctgttctctg tgggacccta tttagaaacg aggtctgagt tactgggggt catcactgtg 34 80 

ttctgatggc ccagctgtgt ggaggccgcg gtgcagcccc atccaaggag ccagggccct 354 0 

gggtctagcc gtgaccagaa tgcatgcccc ggaggtgttt ctcatctcgc acctgtgttg 3600 

cctggtgtgt caagtggtcg tgaaactctg tgttagctct tggtgttcct gaaagtgccc 3660 

ccgggtctca ggcctcagaa ccagggtttc ccttcatctc ggtggcctgg gagcatctgg 3720 

gcagttgagc aaagagggcg attcacttga aggatgtgtc tggccctgcc taggagcccc 37 80 

ccggcacggt gctggggcct gaagctgccc tcgggtggtg gagaggaggg agcgatgaag 3840 

tggcgtcgag ctgggcagga agggtgagcc cctgcaaggt gggcatgctg gggacgctga 3900 

gcagcatggc cagcagctgg gtctgcagcc tggtacccgg cgggacttgt ggttggggct 3 960 

ggtttgtggc caggagaggg gctggcagga gacaaggggg actgtgaggc agctcccacc 402 0 

cagcagctga agcccaatgg cctggctgtg tggctctcag ctgcgtgcat aacctctcag 4080 

tgcttcagtt ctctcatttg taaaatgagg aaacaaacag tgccagcctc ccagaggtgt 414 0 

catgaggatg aacgagtgac catgtagcat gggctgggtg cgtgtcacct aacatcacca 4200 

gcctttgcaa ggagagccct gggggcctgg ctgagtattt cccttgcccg gcccacccca 4260 

ggcctagact tgtgcctgct gcaggccctt gacccctgac cccattgcac ctgtctccac 432 0 

aggagccgag gaggtgctgc tgctggcccg gcggacggac ctacggagga tctcgctgga 43 80 

cacgccggac ttcaccgaca tcgtgctgca ggtggacgac atccggcacg ccattgccat 444 0 

cgactacgac ccgctagagg gctatgtcta ctggacagat gacgaggtgc gggccatccg 4500 

cagggcgtac ctggacgggt ctggggcgca gacgctggtc aacaccgaga tcaacgaccc 4560 

cgatggcatc gcggtcgact gggtggcccg aaacctctac tggaccgaca cgggcacgga 462 0 

ccgcatcgag gtgacgcgcc tcaacggcac ctcccgcaag atcctggtgt cggaggacct 4680 

ggacgagccc cgagccatcg cactgcaccc cgtgatgggg taagacgggc gggggctggg 4740 

gcctggagcc agggccaggc caagcacagg cgagagggag attgacctgg acctgtcatt 4800 

ctgggacact gtcttgcatc agaacccgga ggagggcttg ttaaaacacc ggcagctggg 4 860 

ccccaccccc agagcggtga ttcaggagct ccagggcggg gctgaagact tgggtttcta 492 0 

acaagcaccc cagtggtccg gtgctgctgc tgggtccatg cgtagaaagc cctggagacc 4980 

tggagggagc cctttgttcc cctggcttca gtttcctcat ctgtagaatg gaacggtcca 504 0 

tctgggtgat ttccaggatg acagtagtga cagtaagggc agcctctgtg acactgacca 5100 

cagtacaggc caggcctctt tttttctttt tttttttttg agatggagtc tcactctgtc 5160 

gcccaggctg gagtgcagtg gtgtgatctc agctcactac aacctctgcc tcctgggctc 522 0 

aagtgattct cctgcctcag cctcctgagt agctgggatt acaggtgcct gccactgtgc 5280 

ttggctaatg tttgtatttt tggtagagat ggggtttcac cgtcttggcc aggctggtcg 5340 

caaactcctg acctcaggtg atccacctgc ctcagcctcc caaagtgctg ggattacagg 5400 

catgagccac cacgcccggt caggccaggc ctcttttgaa cactttgcac accatgggtc 5460 

ttttcatcca ggggggtagg tacagttgta cagttgagga cactgaagcc cagagaggct 552 0 

cagggacttg cccagggtca cacagcagga tgtggcaggt gtggggctgg gcctggcagc 5580 

gtggctccag ctttccagca tagaaatctg tgaaagcaga tagtttgtcg gtcggtaggg 5640 

gagactttct gagacccgcc ccagcggctc agagggtagt agccaggggc cttcctgggg 5700 

gctcataacc cagaacactg aatgggaaaa ccctgatgga ggaggcgcag tggagctgtg 5760 

ggtgccgatg ggaagtccca gaggagctgg gaggtcagta gcggtgctgc cctctgtgga 5820 

gcacttagtg ggcaccaggt gtgtttccag gttcatggcc ctgggacctg aagctcagaa 5880 

ggtgaagtaa cttgcccagg gcacccgtcg ggcagcggcg ggcagaggat ttgtgggctg 594 0 

tggagcctgt gctcgtggcc cagccctggg ggttgtgagt gtgctggccg gggagctttt 6000 

cctgcaagtg gactggtgtc taggagccag catgtcaggc agcaggcagc gggagtgcag 6060 

caggcagcgg gagcacagca ggcagagggc ggggctcgag cagccatccg tggaccctgg 6120 

ggcacggagg catgtgggag agggctgctc catggcagtg gctgaagggc tgggttgtgc 6180 

cccgaggagg gtggatgagg gtaagaagtg gggtccccag gggctttagc aagaggaggc 6240 

ccaggaactg gttgccagct acagtgaagg gaacacggcc ctgaggtcag gagcttggtc 63 00 
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aagtcactgt ctacatgggc ctcggtgtcc tcatctgtga aaaaggaagg gatggggaag 63 60 

ctgactccaa ggcccctcct agccctggtt tcatgagtct gaggatccca gggacatggg 6420 

cttggcagtc tgacctgtga ggtcgtgggg tccagggagg ggcaccgagc tggaagcggg 64 80 

aggcagaggg gctggccggc tgggtcagac acagctgaag cagaggctgt gacttggggc 654 0 

ctcagaacct tcacccctga gctgccaccc caggatctgg gttccctcct tggggggccc '6600 

cagggaacaa gtcacctgtc ctttgcatag gggagccctt cagctatgtg cagaaggttc 6660 

tgctctgccc cttcctccct ctaggtgctc agctcctcca gcccactagt cagatgtgag 6720 

gctgccccag accctgggca gggtcatttc tgtccactga cctttgggat gggagatgag 6780 

ctcttggccc ctgagagtcc aagggctggt gtggtgaaac ccgcacaggg tggaagtggg 6840 

catccctgtc ccaggggagc ccccagggac tctggtcact gggcttgccg ctggcatgct 6900 

cagtcctcca gcacttactg acaccagcat ctactgacac caacatttac aaacaccgac 6960 

attgaccgac accgacattt accgacactg acatttacca acactgttta ccaacactga 7020 

catctactga cactggcatc taccaacact gacatttacc gacactgaca tttaccaaca 7080 

ctatttacca acactgacat ctactgacat tggcatctac caacaccaac atttaccgac 7140 

accaacattt accaacactg aaatttaccg acaccgacat ttaccgacac cgtttaccaa 7200 

caccgacgtt taccgacacc gacatttacc gacactgata tttaccaaca ctgacatcta 7260 

ctgacgctgg catctactga caccgatgcc agcatctacc aacaccgaca tttaccaaca 7320 

ctgacattta ctgacactga tatctactga cactggcatc tactgacacc aacatttacc 7380 

aacaccagca tctaccaaca ccgacattta ccaacaccag catttaccaa caccgatgtt 7440 

taccaacgcc gacgtttacc gacgccagca tctaccaaca ctgacattta ccgacaccga 75 00 

catttaccga cactgacatt tactgacact gacatctact gatactggca tctaccgaca 7560 

ctgatattta ccaacgccag catctactga cactgatgtt taccaacacc gacatttacg 7620 

agcaccgaca tttactgaca ccaatattta ctgacatcaa catttagcca tgtgatgggg 7680 

gccggcttgg gggcaggcct tgctcttggc actggggatg ctgcagagac cagacagact 774 0 

catggggtca tggacttctg cttcttctcc agcctcatgt actggacaga ctggggagag 7 800 

aaccctaaaa tcgagtgtgc caacttggat gggcaggagc ggcgtgtgct ggtcaatgcc 7860 

tccctcgggt ggcccaacgg cctggccctg gacctgcagg aggggaagct ctactgggga 792 0 

gacgccaaga cagacaagat cgaggtgagg ctcctgtgga catgtttgat ccaggaggcc 7980 

aggcccagcc accccctgca gccagatgta cgtattggcg aggcaccgat gggtgcctgt 8 040 

gctctgctat ttggccacat ggaatgcttg agaaaatagt tacaatactt tctgacaaaa 8100 

acgccttgag agggtagcgc tatacaacgt cctgtggtta cgtaagatgt tatcattcgg 8160 

ccaggtgcct gtagacacag ctacttggag actgaggtgg gaggatcgct ggagtccaag 8220 

agtttgaggc cagcccgggc aaaggggaca caggaatcct ctgcactgct tttgccactt 8280 

actgtgagat ttaaattatt tcacaataca aaattaagac aaaaagttaa tcacatatcc 8340 

actgccctgc ttaagacaga aaacatgggt gttgttgaag ccagaggcag ctgctggcct 8400 

gagtttggtg attggttcct aagcagttga aggcagtttt gtttttccat agatgtctgt 8460 

tctccctttg ctgggtgcag cctcgccctg ctgctgtggt cgggtttcag tggcctcgtc 8520 

ccgtggacgc agcctcgccc tgccgctgtg gtcgggtttc agtggcctcg tcccgtggac 858 0 

gcagcctcgc cctgctgctg tggtcgggtt tcagtggcct cgtcccgtgg acgcagcctc 8640 

gccctgccgc tgtggtcggg tttcagtggc ctcgtcccgt ggacgcagcc tcgccctgcc 8700 

gctgtggtcg ggtttcagtg gcctcgtccc atgggcgtgc tttggcagct ttttgctcac 8760 

ctgtggagcc tctcttgagc ttttttgttt gttgtttgtt tttgtttgat tttgtttgat 8 82 0 

tgtttgtttt tgttgtcgtt gttgttgccc aggctggagt gcagtggcgc gatctcagct 8880 

cactgaaacc tctgcctcct tgggttcatg ccattctcct gcctcagcct cccacatagc 8940 

tgggattaca agtgcccgcc accacgcctg gctaaatttt gtatttttag tagacagggg 9000 

gtttcaccat gttggtcagg ctggtctgga actcctggtc tcacatgatc cacctgcctc 9060 

ggcctcccaa agtgttggga ttacaggcgt gagccaccgc gcccagcctc tgttgagcat 912 0 

attttgaggt tctcttggtg ccagtgatat gtacatgtgt ccccatcgca ccatcgtcac 9180 

ccattgaggt gacattggtg cctctcctcg gggtggatgc ctccctctgt ttccagcaac 9240 

ttctgaagga ttttcctgag ctgcatcagt ccttgttgac gtcaccatcg gggtcacctt 93 00 

tgctctcctc agggctccca ggggaggccc gaatcaggca gcttgcaggg cagggcagga 9360 

tggagaacac gagtgtgtgt ctgtgttgca ggatttcaga ccctgcttct gagcgggagg 9420 

agtctcagca ccttcagggt ggggaaccca gggatggggg aggctgagtg gacgcccttc 94 80 

ccacgaaaac cctaggagct gcaggtgtgg ccatttcctg ctggagctcc ttgtaaatgt 9540 
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tttgtttttg gcaaggccca tgtttgcggg ccgctgagga tgatttgcct tcacgcatcc 9600 

ccgctacccg tgggagcagg tcagggactc gcgtgtctgt ggcacaccag gcctgtgaca 9660 

ggcgttgttc catgtactgt ctcagcagtg gttttcttga gacagggtct cgctcgctca 972 0 

cccaggcgag agtgcagtgg cgcaatcacg gctcgctgta gcctcaatct ccctgggctc 9780 

aggtgatcct cctgcctcac cctctgagta gctgggacta cagacacata ccaccacacc 9840 

cagctagttt ttgtgtattt tttgtggggg gagatggggt ttcgctgtgg tgcccaagct 9900 

gatctcaaac tcctgaggca caagcgatcc acctgcctcg gcctcccaaa gtgctgggat 9960 

gacaggcatc agccgtcaca cgcagctcaa tgattttatt gtggtaaaat aaacatagca 1002 0 

caaaattgat gattttaacc attttaaagt gaacagttca ggctgggcgt ggtggcttat 100 80 

gcttgtaatc ccagtacttt gagaggctga ggtgggcaga tcacctgagg tcaggagttt 10140 

gagaccagcc tggccaacat gatgaaatcc agtctctact aaaaatacaa aaattagccg 10200 

ggcatggtgg caggtgcctg taatcccagc tactcgggag gctgaggcag gagaatcgct 10260 

tgagcccggg aggtggaggt tgcagtgatc tgagatcatg ccactgcact ccaatctgtg 10320 

tgacagagca agactctgtc ttgaaaaata aataaataaa aaaaatttta aaaagtgaac 10380 

aattcagggc atttagtatg aggacaatgt ggtgcaggta tctctgctac tatctacttc 10440 

tagaacactt tcttctgccc tgaaggaaac cccatgccca ccggcactca cgcccattct 10500 

cccctctctc ccagcctctg tcaaccacta atctactttc tgtctctggg ggttcacttc 10560 

ttctggacgt tttgtgtgac tggaatcctg caatatgtgg tccctgcgtg tggcttcttt 10620 

ccatagcatt gtgttttcca gattcaccca cacattgtcg cacgttatca gaatctcatt 10680 

cctgactggg tgcagtgggt taggcctgta atcctaacat tctgggaggc caaggcggga 10740 

cgatcacttg aggcaggagt ttgagaccag cctggccagc ctagcaagac cccagctacc 10800 

aaaaaatttt aaaagttaac tgaacgtggt ggtggtgggc acttgtggtt cccagctacc 10860 

tgggaggctg aggtgggagg atcgcttaag cccaggaggt caaggctgca gtgagctatg 10920 

atcgcaccac tgcactccag cctggacaac agagcaagac cctgtctgaa aaaaaaaaca 10980 

aaaaaaaaag ttcctttctt tttgtggctg gatgacatcc cattgtatgg ccacagcaca 11040 

ttttgtttgt ctgtttatcg ggtggtgggc agtggtttcc accttttgtc tcctgtgaat 11100 

aatgctgctg tgaacatttg aattcaagtt tttgtttgaa cacctgttgt gaattatttg 11160 

gatatatgtg taggggtagg attgctgagt cctatggtaa tgttaggttt gacttactga 11220 

ggaaccatta aactgttttc aacagtggct gcgccgttct gcatccccac cggcagtgtg 1128 0 

tgagggttct gactttacct cctcacaaac gcttcttttc catttaaaaa aatattcagc 11340 

caggtgctct ggctcacgcc tgtaatccca gcactttggg aggccgtggc gggcggatca 11400 

cctgaggtca ggagttcgag acgagcctgg ccaacatggt gtaaccccat ctctaccaaa 11460 

aatataaaaa ttagccgggt gtggcagcgg gcgcctgtaa tcccagctac ttgggaggct 1152 0 

gaggcaggag aatcacttga acccgggagg cagaggttgc agtgagccaa gatcgcgcca 11580 

ctacactcca gcctgggtga caagagtgaa actccatcta aaataaaaca aaaataaaaa 11640 

taaataaaaa tttattaaaa cattcatcac agccagccta gtgggtgtcc catgtggctt 11700 

tgcctcgcat ttccctgata actaggatgc tgagcgtctt gtcccaggct tgccacacct 11760 

cagcactttg agatacgtcg cacagtcccc atttgcgaac gagaaatgag gtttagggaa 11820 

cagcagctgt gtcatgtcac acagcgagca gggggtctct gagccgtctg accccacagc 11880 

cgaccaagct ccaatcctta ccgcctccta gtgttgtgga tgtagcccag ggtgctccca 11940 

catttttcag atgagaacac cgaagctcaa aacaggagcg ttttgtccac attggataca 12000 

cgatgtctgt ggtttggtcc tgaagtcact ttatatctca gtggtccaga ctggagtagg 12 060 

acagggggtt ctggggaatg gggaaggtgt ctcaggtgaa aggaaggaat tccagattct 1212 0 

ccatactgtc cttgggaagt tagaagactc agagggtctg gcaaagtcag acaaagcaag 1218 0 

agaaatgcag tcaggaggaa gcggagctgt ccaggaacag gggggtcgca ggagctcacc 12240 

cccaggaact acacttgctg gggccttcgt gtcacaatga cgtgagcact gcgtgttgat 12300 

tacccacttt tttttttttt ttgaggtgga gtctcgctct cttgcccagt ctggagtgca 12360 

gtggcacgat ctcggctcac tgcaagctct gcctcccggg ttcatgccat tctcctgcct 1242 0 

cagcctcccg cgtagctggg actacaggcg cctgccaccg cgcccggcta atttttgtat 12480 

ttttagtaga gatgggattt cactacatta gccaggatgg tctcgatctc ctgacctcat 1254 0 

gatccgcccg tctcggcctc ccaaagtgct gggattacag gcgtgagcca ccgcgcccgg 12600 

cccgatttcc cactttaaga atctgtctgt acatcctcaa agccctatac acagtgctgg 12660 

gttgctatag ggaatatgag gcttacaggc catggtgctg gacacacaga agggacggag 12720 

gtcaggaggt agaagggcgg agagagggaa caggcggagg tcacatcctt ggctttcaaa 12780 
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atgggccagg 
ttgagagcaa 
aagctgaggt 
gagaccccgt 
tcccagctgc 
agtgagccga 
aaaaaaaaaa 
agtagcagaa 
cctgacgatt 
gtgatggcgc 
cactgcaggc 
acattgcata 
cctggtggcg 
ctgcacgtcc 
cctcggatgt 
ttcctcctgc 
gcttctcagg 
gatgcgccag 
. tcgttcagga 
gggacagtgg 
gcaaggagcc 
gaggtataaa 
ctatgtcggg 
acgtgttttg 
gtggtgtgcg 
gctagccggg 
agaggacttc 
tatgaaatga 

agcggagacg 

cctgggaatc 
cccatcctgc 
ccctgggcac 
gcttggagcc 
aggtcccctg 
ttttccagcc 
cttagccctc 
tgcagccaca 
acgttcttgc 
cccctctgcc 
ttttatatcc 
gtgggtctct 
gctggaggga 
ccctctcccc 
ccttggcctt 
ctggaagctg 
ccatcaccgt 
ctcccagagc 
acggggactc 
gacaggcacc 
tgggacaggg 
gggctttggc 
tgtgtgtgtg 
catgaagcaa 
cttcttccct 



gagagacacc 
ccgaggtggc 
gggtggattg 
ctctactaaa 
ttgggaggct 
gatcccgcca 
aaaaaaaaga 

gggggatcgt 

tggagctcat 
ggggcacggg 
cttctcagcg 
agcgttttca 
cactctctgg 
agagctgact 
gcccgtggag 
aagtggccac 
tccttgccca 
tgttcctgct 
aacacccgct 
ctgctgctgg 
acggctcctt 
tggggtcagg 
aggcaacaag 
ccgctcacct 
cagctgggct 
ctctgccttt 
acgggcagct 
ggtttcatgt 
gcggatgggt 
agcttccagg 
gaccagctgt 
cctccccacc 
gcatccgagt 
attgcatccc 
tcctgggcta 
ttcctgagtc 
ctcagcttct 
ctccccgccc 
ctgcccttct 
tgtgccacaa 
gaggcccacg 
cagcccagcc 
tggaagccgg 
ccagggaagg 
tggcccttgt 
catcatcacc 
agagaccctc 
aggacgtacc 
cctgactttt 
tctgttggct 
actgcagagc 
tgtgtgtgtg 
gataactgtt 
cagacaacag 



ctctgagcat 
tgggcgtggt 
cttgaggcca 
tatacaaaaa 
gaggcaggag 
ctgcactcca 
taaaaagacc 
ggagcaggcc 
cactgggggc 
tgtctttggt 
ccctttatat 
gactacctca 
gaaggtgcag 
tagcctcgag 
atgctggctc 
cgctcttggg 
gcgccaggag 
gtggaccccc 
gagtacgcag 
ggttgtggtg 
gaggatgcag 
gaggagacac 
gtggccaacc 
ttgctgggga 
gtgtggagct 
gaacacaatc 
gtggctgatc 
agatggcatt 
ctgggaggcc 
ccgccccagc 
tgccatcatc 
tgctggggcc 
cctgccccac 
cacttccctt 
ggaattccag 
tagagccccg 
cctgtgagcc 
acctgggctc 
cccctctacc 
gacatggctg 
ctccagtgcc 
ctctcccgaa 
ggtcactcca 
tcctctttcc 
atgcccctcc 
atcatcattg 
tgaggtccag 
aggtctgggt 
gctaagtggg 
gcactgtgct 
tgtgtgtgtt 
tttgcgtgcg 
gctgtttcct 
aactctcccc 



ggtaggacag 
ggcttacgcc 
ggagttcaag 
ttagccaggc 
aattgcttaa 
gcctgggcca 
aaccgaggaa 
accctgtggt 
ctaaggagaa 
ggccagaact 
gcttacacag 
tcatgggtcc 
gtggatgccc 
attgctgctg 
tgtgttttct 
tatgtcctca 
caaaccctcc 
atctcacatg 
tgtgtgccag 
gcttctgggg 
ttggactcca 
atttggaaca 
caggtgctga 
gtggtgggag 
gggcgctagg 
aggctccagg 
ccttgatgaa 
agagacgccc 
cctcctgctg 
agcctgactg 
agtgacagct 
cacccagggc 
cacgtgtgaa 
ctgcttcaca 
tgttgtgctg 
ggggttggaa 
tccagcatgt 
agccagggga 
ctgcccttct 
tgtgtgaaag 
actcttccca 
ccccagcccc 
gccgtatgcc 
aaatggcgac 
agggtctgtg 
tcatttcgct 
cctgagttgg 
caggagtgac 
tggaggtgac 
cccagggatc 
tgtgtgtgtg 
cgcacatgtg 
tttgggtttt 
tttaaacacg 



gaaagcaaga 
tgtaatccca 
accagcctgg 
gtgatggtgc 
acctgggagg 
cagagtgaga 
ttgaagtggg 
catgcactgg 
tagatactga 
tggggactgc 
gctgtttcta 
ctttctttac 
agacccgccc 
gcacctcctg 
gctggagttt 
ggcttctgcg 
tggcactttg 
agggtcttgg 
ctgtgtccca 
actctgggga 
ggtggaaggg 
atgggaacat 
ggagcccaca 
agaggattcc 
aggaaggtgc 
ttttcagcat 
ttgggagaag 
acaacagatt 
gccttgactg 
acacacacag 
gggagtggcg 
agtcctgaca 
gcccgagtgg 
tagctgcctc 
gctttgcccc 
gttctggccc 
cccctcagga 
aggcctggct 
ctcctctgcc 
tggcagggtc 
cccgctggcc 
atgtgcccag 
atggtgggga 
acctggtccc 
cgctcggttg 
tgtctgtgag 
ggtctccgtg 
ccccaaacct 
atcacttaca 
tggggagagg 
tgtgtgtgtg 
tataagatct 
gtgttcaaca 
tgctgtcaga 



ttggaacaca 
acactttgga 
ccaacatggt 
atacctgtaa 
cggaggttgc 

ctccatctca 

ggggcgtcac 

aagctcatta 
aggatgagga 
tggggtgcct 
agagggggat 
cctctgtggc 
tgccatccac 
ccccgggaca 
ggtgcgtctt 
agtcatggct 
ttcaggggtg 
gcctgcaggc 
ggcaatggcg 
cagctgaggt 
atggttgggg 
ttttaagatg 
ccagccctgg 
gttccacgtg 
tttctgcggg 
ccagtgcatg 
aacaaaggtc 
tacagagtgg 
tgacagctgt 
gggttttagc 
gtggttccag 
cctacaggtt 
tcgtgggctg 
ttctcaccgt 
aggacacctc 
ctgggacacc 
ccaagccctc 
gggagcgtct 
ccgccatggc 
tggcatctct 
gtgccctcat 
ctgcccccgg 
catcctgctt 
tgcctggagg 
gcccgagttc 
ccggcctggt 
ctgacccctg 
cgtgcccttt 
gcgggagtga 
ctatatccct 
tgtgtgtgtg 
ttttttatta 
gagtggggta 
gggtgggtct 



12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
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tgggctcatg tctgtttgca cagccgagtc agaggaaaca cagggttctt cataaaaaca 1608 0 

ctgcacagca ggcgactgtc cagagtcagc ctgcaggacg gcagcagccc tgcccctcag 1614 0 

agcacagcta gggtgggctg ctttgggatc tcccgtcatt ccctcccagc tggcagccgg 16200 

cggccggccc attccttggt gtgctggtca ggggggcgtg cgcctgctct gctcaccctg 16260 

ggaatgggac agaagctggc agctcggaga ggacagggct ggacccttgg gtggcctctg 1632 0 

gctggaccat ctcattgtcc tcagacacag cctctcgggt ctagtttcat ttcctgaaaa 16380 

acaagtgcac agaactagag caggagtcga gagctacggc ccccgggcca gatccagccc 16440 

tgccacctgt tttcacacca tgctcaagct gagtgggttt tacatttttt aattacttga 16500 

aaaaaaaaaa gccaaaggag gtttcatgac ccatgaaaat tatatggaat tcaaaaaaaa 16560 

aaaattatat ggaattcaaa tttcagtgtc cataaataat ttcttgagac agggtctcgc 1662 0 

tctgtcaccc aggctggagt gcagtgctat ggcatggctc gctgtaccct tgacctccca 1668 0 

ggctcaagcg atcctcctgt ctcagcctcc tgagtagctg ggactacggg tgtgtgccac 16740 

caagcccggc taattttttt ttaattttag taaagacagg gtctttctat gttgcccagg 16800 

cttttctgga actccatctt ggcctcccaa agtgctggga ttacaggctc gagccacgga 16860 

gcccagcctg tttttgtttt ttcactgata aagttttgcc gggtgtggta gtgtgtgcct 16920 

ctagcgattt gggaggctga ggtgggagga tcgcttaagc ccaggagttt gaggctgggc 1698 0 

tcaagtgatc aggaggtgaa ctatgatcat gtcattgcat tccagcctgg gtgacagagc 1704 0 

aagaacctat ctcttaaaaa tatatattta aaaagtattg ggtgtggtgg ctcacgcctg 17100 

tggtcccagc tacttaggca tctgaggtgg gaggatggct tgagcccagg agtttgaggt 17160 

tgcagcgagc caagatcgtg tcactacact ctagcctggg tgacagagcc cagaccctgc 1722 0 

ctctttaaaa aaaaaaacca aaaaacatgt attggaacac agccatgcct gttcagtcac 17280 

gtgctctcca tgctgctttc tgctccagag acccttatgg cctgaaagct gaaaatattt 17340 

tctatccttt acaaaaaagt ttgctgacct ctgtcctgga aaattcatct cccaagttct 17400 

cttccggcac tggcgttcct gggtgtccta aatttggccc ctgttatttc tgaactctgt 17460 

tttggctctg ttccctccca ggagccagga caggcacgtt ctctgcatct tgtcccctga 1752 0 

cgcccagagg cttggctcgg ctcaggcatt cttggaaata tctggctcca ggaaaggcag 17580 

aggcctcctg agtcagccca gagggaacct gccccaggtc tgggggaggc ctgacccagc 17640 

agagtggctt ttgccgatgg gttgggccgg tcaagatgtg ctgaaagttg tcctcagaag 17700 

gccactttgg gattccttcc tccagtatta gagcaactga gagctgctca ttgcaagcct 17760 

gatgttttcc cagttggccg ggtccaccgg gtgccctggg attctgggat ctgggtggaa 1782 0 

agtagggggc ttgggggagt gtcctgggtt ctggaatcca ggtggcaagt ggtgaggttc 17880 

agggagtggc ttctgagcca ccataggggt ctctgtggga ggctctgccc atccaggaga 17940 

ttccgcaggc cctgccggcc cagagccagc gtcttgcgct tgccgaggct acagccagcc 18000 

ccagccgggt ggaacagccc gtcgcctcct ctcactttgt tttggggcca cctgggagtg 18 060 

tggagcaagg gtagagaggg aggaagtggc tgccggccgc tgcccagcac ccttgtttgc 18120 

cttgggccct ctgtgggctc ctttttattg ctcttcaatg aagccaggga aatggacttc 18180 

cttgcctcac ttcagttcaa catgtctgga agtttggtat taaaattaag aaagtgtgga 18240 

aatagagcaa gaagagaaaa atctctccaa gagataatag tgacctctga gctgggcgcg 183 0 0 

gtggctcacg cctgtaaatc ccagtacttt gggaggctga ggcgggcaga tcacctgagg 183 60 

tcgggagttt gtgaccggcc tgaccaagat ggagaaaccc cgtctctact aaaaataaat 1842 0 

aaataaataa ataaataaat acaaaattag ccaggcatgg tggcgcctgc ctataatccc 18480 

agctaaggca ggagaatcgc ttgaacctgg gaggcaaagg ttgcagtgag ccaagatcac 18540 

gccattgcac tctagtctgg gcaacaagag tgaaactccg tctcaaaaaa aataaataaa 18600 

taaaaaataa aaatagtgac ctctggccag gtgtggcagc tcatacccgt aatcccagca 18660 

ctttggaagg aaggccgaga tgggcagatt gctttagcac aggagtttga gaccagcctg 1872 0 

gccaacatgg tggaacccca tctctacaaa aatagaataa aatttaagag gtaatagtga 18780 

ccttttggta gatcgaaacc tggattgctt tctttttcta aatgctgatt cttttctttg 18 84 0 

tggtgtttgt gttctgtgcc gatgtccctc ccccagccct gttattgtga gtggaagaag 18900 

gggaaagggt tcgcccgcta ctgtgagccc ctcctctcac gctgggtgtc cttggagaag 18960 

cctgcacttc ttcattgtac gccagggctg ggtccctccc tggagtggtt ctgtgctgct 19020 

gggatggggc caacccctca gatgttttct gagtgtcaca cacaggtgtg tgcattcatg 1908 0 

gcctttgcgt gtcttcctgt tgtggaggca aaaatgtgaa gaaccctaga tgattttggg 19140 

accagggctc catcacctgc tgttcattgc acaccggagc atccaggcat gggtggagag 192 0 0 

ctcagacttc caggcacggt cgcaggggct ggtctaacca tgttcccgcc cgcctgctcg 19260 
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tcagaaccgc ctgttgggag ctgttatcat gataccatac ctgggccctg ggctatccga 1932 0 

ttctgactta attgctccag gttggggcca ggccgttgtt tgctgttttg ttgtttcttc 193 80 

tgtgacgtta gccactgggc taatctgagc ccctcagtta caggtggaga aactgagacc 1944 0 

catgggggtg caaggacttg ccgaggaccc agagcccctt gggggcagag ctgaggcggg 19500 

gcctggcttt gggtcccaga gcttccagtc cccttcccgc tctcctaaca gctttttttt 19550 

ttgagacaag atctcaccct gtcacccagg ctggagtgca atggcatgat ctcggctcac 19620 

tgcaatcttc gctagctgcg ttccagcgat tctcctgcct cagcctcccg agcagctggg 19680 

attacaggtg tgtgccgcca tgcccagctc gttttttttt gtacttttag tagagatagg 19740 

gtttcaccat gttggccagg ctgatctcga actcctgacc tcaaatgatc cgcctgcctc 19800 

ggcctcccaa agtgctagga ttacaggctg ggatcacact gtgcctggcc ctagcagctt 19860 

tgtcctgtgc catccaacaa cagatgaccg aagtctttgt ttcttaacat gcattccatc 19920 

tgccttacag ttttgccacc tgcaaaacag aggacttgtc gcttttctgg taagctggaa 19980 

atgtaatctg gtagcaggag gcctgtggaa gcttgccttt aatggccttg tgtctctttc 20040 

atcctgtcct gagagccgga gaacttggat gttgcaccta actcaacctt cctgttaaca 20100 

tacagttctg caggctcatg gatcatcaga accacgtcct atctcacgcg gctgtatgct 20160 

tccgttggtt caggtgtttt taccttgaca gtattttctc ctcggtggct tttgcggtgg 20220 

ttgcttttaa tcagcattga ctcttcaaga aaaatattta gctgctacat ctcagaggag 20280 

acagggtgga aagcatctga gacctgcagg ctcagactta gaaccagaag tgccctcaga 20340 

gttcatccgg ccctgaccca gcgggaaatg agttcacaga gaagcgggag aactttgccc 204 00 

caggccctgc cgttgctcat aactgcccca ggtccttaca tttgctccag gtcctgcccc 20460 

aggccctgca gttgctcata actgccccag gtccttatat ttgctccagg tcctgcccca 20520 

ggtcctgcag ttgctctgtg tggtgggtgt gatctggagc cctccgccca ttgctgcacc 20580 

tggggcaggc attgctaatt gatcccagga ctccttcctg cggagcacgc cctggttctc 20640 

caggcagccg ctgcctgtca gcctgcagtg gttcgggaga ggacacctgc ttgcctggtc 20700 

tgttccaaat cttgcttctc atcccagcac aggtaggggg tgctatggga aagggatcct 20760 

cagttggccc tgtcactgct ctatcagctg gggacgtggc atcctagtga aaacatcatg 20820 

gccgggcgcg gtggctcacg cctggaatcc cagcactttg ggaggctgag gagggtggat 20880 

cacttgaggt cagaagttcg agaccagcct ggtcaacatg gtgaaaccca tctctactaa 20940 

aaatacaaaa attcgccagg tgtggtggcg ggtacctgta atccgagcta ctcgggaggc 21000 

tgaggcagga gaatcgcttg aacctgggag gtggagcttg cagtgagccg agatcttgcc 21060 

actgcactcc agcctgggca acagagtgag acgctgtctc aaaatctcaa acaaacaaac 2112 0 

aaacaaaaaa caaacaaaca aagcgtcatt tatccagcac ccctggggaa ccatgctacc 21180 

tggtgtttta tggtacctgg caaggtgcag gtgaagttgc tgctcttggg cattgaaccc 21240 

gtcttgtttg gggcagctca ggccccaggc agggtccggg ttggctctcg ttggtgtggc 21300 

cctggcccat ccagacctat atttctgccg tcctgcaggt gatcaatgtt gatgggacga 213 60 

agaggcggac cctcctggag gacaagctcc cgcacatttt cgggttcacg ctgctggggg 21420 

acttcatcta ctggactgac tggcagcgcc gcagcatcga gcgggtgcac aaggtcaagg 214 80 

ccagccggga cgtcatcatt gaccagctgc ccgacctgat ggggctcaaa gctgtgaatg 21540 

tggccaaggt cgtcggtgag tccggggggt cccaagccat ggctcagcca tgcagacttg 21600 

catgaggagg aagtgacggg tccatgcctg ggcataagtg ttgagctcag gtgccccgac 21660 

ctggggaagg gcaggacagg aaaggtgaca gtatctggcc aaggacagat gggaagggac 21720 

caagggagct gattagggag tggttatgga ctaggaatgt cggtaacaat ggttagaaag 217 80 

tgactaacat ttgttgagca cctgctgtgt gcccggccct ggccgggagc cttcgtgccc 21840 

acagtgaccc cgtctgcaaa tgtagttcct tgccctactc gcactgggga gcaggacgca 2190 0 

gagccgtgca tctcacaggt gccaagctca ggactccctc ctgggtctgc ctgggctggg 21960 

ctgtgcttgt tgcccctgtg gcccacgcat gtgcaccttc cacctgaaag ccaggatctt 22020 

caggacgctc cccgaggagg tcgttgtctg gcacaatgat ttgtctcttc ctgaaaaggt 22080 

gacagagtta cactggagag agcagcatcc aggtgcggca gggacaggcc tggggctcgc 22140 

gggcagggac tctgtgtcct gccggggtcc cacactgcac ctgcttgtca gaggcactca 22200 

gtcaatcttt gctgatgaag gatgagagga cagaggacgt gatgcttgct gctgcattgc 22260 

ctgcagtcct gggtgagatg cccgggttga ctctgctgcc cgtcgggtgg atgtgatgtc 22320 

agatccccgg ctttaaaata cgagggagct gggaattgag ggagcaggtt ggggcagaaa 223 80 

gcacagcccc gtggaagcct ggagctgagg cagtgtgggc gacccctgga gcagtgagtg 22440 

cttccttcat ggccttcatc gcaccctgca gtcctcatgt aggggatgcc atccatgaat 22500 
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ttagttttcc cagcctcctt taaaaacgcg ttcatgctgg ggccggggca gtgcagtggc 22560 

tcacatctga aatcccacca ctttgggagg ccgaggcggg tggatcatga ggtcaggaga 22620 

tcgagaccat cctggctaac aaggtgaaac cccgtctcta ctaaaaatac aaaaaattag 22680 

ccgggtgcgg tggcgggcgc ctgtagtccc agctactcgg gaggctgagg caggagaatg 22740 

gcgtgaaccc gggaagcgga gcttgcagtg agccgagatt gcgccactgc agtccgcagt 22 800 

ccggcctggg cgacagagcg agactccgtc tcaaaaaaaa aaaaaaaagt acaaaaaaaa 22 860 

aaaaattagt ctgggtgtgg tatcacgcgc ctataatctc actactcgag aggctgaggc 22 920 

ggagaattgc ttgaacccag gaggtagagg ttgtagtgag cccgtatcgt accactgccc 22980 

tccacctggg caatagagcg agactctgtc tcaaaaagaa aaaaaaaaaa agaacattta 23 040 

tgccaggtgt ggtggctcat gcctgaaatc ccagaacttt ggaagactga ggcaggagga 23100 

tcacttgagc ccagaaattt gagagtgtct tccctgggca acatagagag acctcatctc 23160 

taccagaaaa aaaaaaatta gcccggcatg gtggcatatc cctgtggtcc cagctactta 23220 

gggggctgac gtggcaggat cacctgagtc tggaggcaga ggttgaagtg agctgagatc 23280 

atgccactgc actccagcct gggtgacaga cagagaccct gtctcaaaaa aaaaaaaaaa 23340 

aaaaagcatt tactatccac catggaaggt gagactgacc tgtgagtgat tgttcaaaga 23400 

acaaaaaata aaccccagag ataagacaaa agggtgcctc catgggggtg tgatttaaag 23460 

ctgagaaatt gggcttcttc cccctcccct ctcaccccgt ggtttgctaa aggagatggg 2352 0 

aaaaaggatt ctttttttgg ctgaaatatt taacactaaa ttaaagccaa ttttaacagc 23580 

actttggttg atgagtgaaa ttaacagact ggccaaaaat aaacgaacgg tctgtactat 2364 0 

gtgaaaaaga ggcagctttg gccatgctgg gccaatgtga gttttcaggg ttgctgggaa 23 700 

tgtctgtgaa tcggaggaag ggcctagctg ggactctcag gagccaaggc cctgaggggc 23 760 

aacttgcctg gtccctgccc tgaggcgttc actgctttct tcctgggcca gatcacaggc 23 82 0 

ccggaggctg gaccactggg ctggcactct tgccgagctg ctccctgact tcctgaccat 23 880 

gctcctttca gcagccttgc tgcactttag tttccttgaa tgaaaaatgg ggatgagaat 23940 

agctcctacc tccaaggtga atggagtgag ttcggacagg tgactccctg ggaccagtgc 24 00 0 

ctggcgcctg acaaggtcca gtcagagccc gcactgctgt tactgatacc cttggctgta 24 060 

ccaggggaga acttggttgc cattgccagg tgttctccca ccacccccac tactgtccct 24120 

gtttgatgtg tggcgggaat aaagctgtgc acattggagc ttttggcaca tcctggcttt 24180 

caggtgaaag gtgcgtgtgt gtttgagggt ttagcctggc caacccagcc atgaggtcgg 24240 

acctgacctg ggggtgagtc ctgagctcgg cacccctgag ctgtgtggct cacggcagca 24300 

ttcattgtgt ggcttggccg cacccctttc cctgctgggc tgttgatgtt tagactggag 243 60 

cctctgtgtt cgcttccagg aaccaacccg tgtgcggaca ggaacggggg gtgcagccac 24420 

ctgtgcttct tcacacccca cgcaacccgg tgtggctgcc ccatcggcct ggagctgctg 24480 

agtgacatga agacctgcat cgtgcctgag gccttcttgg tcttcaccag cagagccgcc 2454 0 

atccacagga tctccctcga gaccaataac aacgacgtgg ccatcccgct cacgggcgtc 24600 

aaggaggcct cagccctgga ctttgatgtg tccaacaacc acatctactg gacagacgtc 24660 

agcctgaagg tagcgtgggc cagaacgtgc acacaggcag cctttatggg aaaaccttgc 24720 

ctctgttcct gcctcaaagg cttcagacac ttttcttaaa gcactatcgt atttattgta 24780 

acgcagttca agctaatcaa atatgagcaa gcctatttaa aaaaaaaaaa gatgattata 24 84 0 

atgagcaagt ccggtagaca cacataaggg cttttgtgaa atgcttgtgt gaatgtgaaa 24900 

tatttgttgt ccgttgagct tgacttcaga caccccaccc actcccttgt cggtgcccgt 24960 

ttgctcagca gactctttct tcatttatag tgcaaatgta aacatccagg acaaatacag 2502 0 

gaagactttt tttttttttt tttgagacag agtcttactc tgttgcccag gctggagtac 25080 

cgtagcgtga gctcagctca ctgcaacctc cgcctcccag gttcaagcga ttcttctgcc 25140 

tcagcctcct gagtagctgg gactacagac atgcaccacc acacccagct aatttttttt 25200 

atatttttag tagagacagg gtttcatcat gttggccagg ctggtcttga actcctgacc 25260 

tcaggtgatc tgcccgcctc ggcctcccaa agtgctgaga taacaggtgt gagccaccgt 25320 

tcccggcata ggaaaacttt ttgccttcta aagaagagtt tagcaaacta gtctgtgggc 25380 

tggccttctg attctgtaaa gaaagtttga ttggtggctg ggtgcggtgg ctcacacctg 25440 

taatcccatc actttgggag gccgacgtgg gcatatcacc tgatgtcggg acttcgagac 25500 

cagcctcacc aacgtggaga aaccccgtct ctactaaaaa tacaaaaaaa aaattaaccg 25560 

ggcatggcgg cgcctgcctg taatcgcagc tactcaggag gctgaagcag gagaattgct 25620 

tgaacctggg aggcggaggt tgtggtgagc tgagatggca ccattgcact ccagcctggg 25680 

caacaaaagt gaaactccgt ctcagaaaaa aaaaagtttg attggtgtaa ccaaagcgca 25740 
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tttgtttatg gattgtctgt ggcagctttt gttctgccga gatgagttgt gacagatctg 25800 

tatgggctct aaagcctaaa acatgtgcca tccgcccctt tacagaaaaa gtgtgctgac 25860 

ctctgttcta aagtattgga caactacaat gtttgctcat ttattattct atgatttgtt 25920 

ttctgctttt tgttgttgtt gttgttgttg agatagggtt tccctctgtc actcaggctg 25980 

gagtgcagtg gtgtaatctc agctcactgc agcctcgacc tcctgggctc tagtgatcct 26040 

ctcatctcag cctccctagt agctgggact acaggcacac accaccactc ctggctgatt 26100 

tttttttttt tttttttttt ttgtggagac agggtttccg catgttgccc aggctggttt 26160 

caaactccta ggctcaaaca cccacctcag cctcccaaag tgctgggatt acaggcgtga 26220 

gccaccatgc ccagcctatt ctactgtttg tattacatag ctttaaaaga ttttttatga 26280 

ctttaagtca caagggttct ttgtagaaaa aaatatatat ataggaaagt ataaaaagaa 2 6340 

agtaaaaatt gtccataacc tctccagcca gagacgaccg ttgctgacac ctcagcatat 26400 

tgcctttaag tcttttttct ctaagatagc atttctcttc atcacagtca tatgctacgc 26460 

agaattctgt atcctgattt tttcacttga cattacaaca ggtatttgat ggcgctgtga 26520 

caaactcttt ggcacaatct tttaaatgta tgaaatactc cactgcacag atgtttgctt 26580 

ttaggcttaa ctgttctttt attttgcgtg tgctggttac agccgggcac agtggctcat 26640 

gcctgtaatc acaacacttt gagagggtga ggcaggagga tcacttgagc ccagaagttt 26700 

gagaccggcc tgggcaacat agtgagaccc catctctaca aaaaactttt ttaataagtc 26760 

gggcgtagtg gtgcatagct gtagtcccag ccaccaagga ggctgagttg ggaggattgc 26820 

ttgagcccca ggaggttgat gctgcagtga cctgagatta ctccactgta ctccaacctg 26880 

agcgacagag caagacttgt ctggggaaaa aaaaaaaaaa aatatatata tatatatata 26940 

tatatataca tatatacata cacgcacaca cacataatat aaaaatatat atttataaat 27000 

atataatata taatataaaa atatatattt ataaataaaa tttataaatt atatttataa 27060 

gtaaatatat aatatataat ataaaaatat atattatata- atatataata aaatatataa 27120 

tataaaaata tatatttata aataatatat aatacatact tataagtata tatttaaaat 27180 

atatgtaatg tatatttttt aatgtatgat atataatata catttataaa tacacattta 27240 

tattatttta tataaaatat atataaaatc tccaagttgc tttttccaaa aaggtgtctt 273 00 

gctgcatttc aaacattcat ttaaaaactt gaatgctggt gatctggtcc agaatgtgtt 27360 

cagtagctgc tgccagtggc caagcatctc gggagatgtc tacaaaacac gctggttctg 27420 

gcctggcgtg gtggctcacg cctgtaatct cagcactttg ggaggctgag gcaggtggat 27480 

caactgaggt ctggatttcg agaccagcct tgccagcttg gtgaaacccc atctctacta 27540 

agaatacaaa aaaattagcc aggcgtggtg gcatgtgcct gtaat cccac ctacttggga 27600 

ggctaaggct ggagaatcgc ttgaacccag ggggcagagg ttgcagtgag ccgagatcgc 27660 

accattgcac tccaggctgg gcaagaagag cgaaactccg tctcaaaaaa aaaaaaaaag 27720 

atgctggttc ctaaaatgtg gcccttttcc tcctcacctg ctgccagacc atcagccgcg 27780 

ccttcatgaa cgggagctcg gtggagcacg tggtggagtt tggccttgac taccccgagg 27840 

gcatggccgt tgactggatg ggcaagaacc tctactgggc cgacactggg accaacagaa 27900 

tcgaagtggc gcggctggac gggcagttcc ggcaagtcct cgtgtggagg gacttggaca 27960 

acccgaggtc gctggccctg gatcccacca aggggtaagt gtttgcctgt cccgtgcgtc 28020 

cttgtgttca cctcgtatga gacagtgcgg gggtgccaac tgggcaaggt ggcaggctgt 2 8080 

ccgtgtggcc ctcagtgatt agagctgtac tgatgtcatt agccttgatg gtggccagga 28140 

ctggtagggc cctcagaggt catggagttc cttcgtggag cgggtgctga ggctgtatca 2 8200 

ggcacagtgc tggctgcttt cacctgggcc gtctcaccga agtgtccatg gagcctgcgt 28260 

agggtgggta tctgtgtcga ttttacagat gcagaaacag gctcagagaa accgagtgac 28320 

ttccctaagg tcacataccc agttagagca gagctgggcc aggaagtgct gtctcaggct 2 83 80 

cctgaccagg tctccttgct ttgcactctt gccaaaacca tgatccagaa ctgactttga 28440 

ggtccccgga cctcaggctc ctccgaaatg gcctcttgga ggctgctgag ccacagctta 2 8500 

ggacccacct cgagaggcaa atgtgctttg agctgccagg cgtcctgggg gccctgcctt 28560 

gggcacgggg ttcagacagg ccccagatgt gtggggcgtc tttctggact tgagttttct 28620 

tttctgtgtg gtggacacag tgctcacccc ttaaagcacc tgtgatgtgt gcagcagccc 28680 

aatccctgcc tgtcgcctgt tctgctaggg aaggaaggaa gacttcagga tggcaggaca 28740 

acagaaagag gtccaggttt tagagcaagg gcaggtcaaa cttagaaaat tctggaatga 2 88 00 

ggatgtgcat ttcctcttct ggatctgcta aaagaagagg gaaggagggg ctgctggggg 28860 

aggagcccag agccgagttt acatccggat cccgcaaggc ctcccctgcc ctgaggtctt 28920 

gttttgtgat gtgcttgtgt ccatcctggt ttctgccgtg tccccaacat ccggccaagc 28980 
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ttaggtggat gttccagcac acactcaccc tgtctgtgca cctgtttttg tgtccgtaag 29040 

tgggtattta ctcaccttac gagtgagcca ctgtgggaat tcagggaggt ggcgcagtga 2 9100 

ccacccctgg agggatatgt gtgtggcagg ggtcgagggt ctcgcccttc cctgcttcct 2 9160 

gcgcgtggct ttctccagga cggggagggc tgagctgaag aggtggggac agttgcgtcc 29220 

ccccgccacc cactgtcctg cggtgagagc agactcactg agcctgccct tctcccttgt 29280 

gccttccagc tacatctact ggaccgagtg gggcggcaag ccgaggatcg tgcgggcctt 2 9340 

catggacggg accaactgca tgacgctggt ggacaaggtg ggccgggcca acgacctcac 2 9400 

cattgactac gctgaccagc gcctctactg gaccgacctg gacaccaaca tgatcgagtc 2 9460 

gtccaacatg ctgggtgagg gccgggctgg ggccttctgg tcatggaggg cggggcagcc 2 952 0 

gggcgttggc cacctcccag cctcgccgca cgtaccctgt ggcctgcaag ttccccaacc 29580 

tggcaggagc tgtggccaca cccacgactg cccagcagcc tcaccctctg ctgtgggagt 29640 

tgtccccgtc cacccctggg tgcctttgct gcagttatgt cgggagaggc tctggtgaca 29700 

gctgtttcct gtgcacctgc tgggcactag gtcccagcta atccctgtgc caggactcta 29760 

atttcaccct aacacacatg gtggttttca ttgctgggga agctgaggcc tgagcacatg 29820 

acttgcctfca ggtcacatag ctggtgagtt caggatcccc cagagatacc agggccagca 2988 0 

ctcgatcccc acccagccct gaaccccacc atgtgctggg attgtgctgg gagtgtccac 2 9940 

acgcctggga ccccagggct ggtgctctca tctccttttt ccagatcatg agaatgaggc 30000 

tcagggaagt ttgaaaaaaa cctatcccaa gtcacacagc aacaggagca ggatttgaac 3 0 060 

ccagaaaagg ggaccgcaca ctctgttctg ctagagtagt tagctgtcct gggtgatatg 3 012 0 

gcaggtgaca ggggcaactg tgcttaacaa aggaaccccc atcccccctg ccaagttggg 30180 

agactagaag gtcaggggca gaagctctga agggccaggt gcagtggctg acacctctaa 3 024 0 

tcccagcact ttgtgaggcc aaggcgggca gatgatttga gcccaggagt tcaagatcag 30300 

cctgggtaat gtagtgagac gccatctcta caaaaaaatt ttttaaaaat tagctgggca 3 0360 

tggtggttca tgcctgtagt ccaagctact tgggaggctc aggtgggagg attgcttgag 3 042 0 

cccaggaggt tgaggttgtg gtgagctgtg atcatgccac tgcactccag cctgggcaat 30480 

agagtgagac cgtctccaaa aaaaaaaaaa gaagaagaaa aagaagctct gaggctccaa 3 054 0 

gtccccaggc accccttggc ttgagggcag acaagggagg agagggtcac ctgggcagcc 30600 

ctgacttttg tcccctggca aagggacctt cagtgacctt ggccctagga gagcctctga 3 0660 

gcacgtcagc catgtcgaac cgctcaggaa gggcagcaag aatttggctt ctgacctctg 3 072 0 

cctctcctac tcgccatctg cactgggtgt ggttgtgccc attttacaga tgaggaggct 30780 

ggggcatcga ccagctgaat gccttgtccc aggtactgcg taggcagagc tggcagttga 3 084 0 

accccgtgtc ctggttgtcg ctgggggtgg gctgcaccct gacttgtgag gccagtagca 3 090 0 

aggtttgcac gtgacttcgt gaccgtcacc cagctctgca gcacatcccg tgacccagct 30960 

catccaggcc gcatgcaaac ctgttgccag gcgagaaacc agtcaccgca cagctgtggt 3102 0 

tgcctgaaat gattaagctc attaatcacc ccggagtgag gacagactca gatgaaaacc 3108 0 

agcaaaagcc ctggaaactc atgtgaccct gccaatgagg gcggccatgt gcattgcagc 3114 0 

ctggccgtca ctcctcggta cgtgttttgg acttaaacgc tccggatgtt tactgagtgc 31200 

ttgattaata acatggaagg cctggtctca ttgctgtggg agtgaaggat gcacagccag 3126 0 

gcctgacatg atgagaacaa gaacctggag tctcgctgcc tgggtggtaa tcctggccct 3132 0 

gccacttagc aactgtgtga ctgtagccag gtcacttaat tttgctagat cctgcctgcg 31380 

cttcagtgga tcttgctggt tttccaaggt ggccaaacac tttaaggcat tcatgtggtc 3144 0 

gctaggctgc agggttgaac cctggctcac cccgcagggc gccgtgtgct ctgtggcctg 3150 0 

gctgtgcctt tgctgacacc gtgcccgtgt gtgttcatgc aggtcaggag cgggtcgtga 31560 

ttgccgacga tctcccgcac ccgttcggtc tgacgcagta cagcgattat atctactgga 3162 0 

cagactggaa tctgcacagc attgagcggg ccgacaagac tagcggccgg aaccgcaccc 31680 

tcatccaggg ccacctggac ttcgtgatgg acatcctggt gttccactcc tcccgccagg 3174 0 

atggcctcaa tgactgtatg cacaacaacg ggcagtgtgg gcagctgtgc cttgccatcc 31800 

ccggcggcca ccgctgcggc tgcgcctcac actacaccct ggaccccagc agccgcaact 31860 

gcagccgtaa gtgcctcatg gtcccccgca cctcactccc tcgttagatc aggctggttc 3192 0 

tgggagctga cgctgaaagg agcttctcat ctggggttcc tgggtgtaca tagatggttg 31980 

ggtaggttgt gcactgcaca agctgcatga tgctacctgg gggtccaggt ccaggctgga 32040 

tggacttgtt gcttcatcag gacatagata aatggccaaa actcctcagc tggaaggtcc 32100 

tgggcaggat ctttgggtgt gaaaaccagt cacaggggaa gggtgcttgc tcatactgcc 32160 

agcacagtgc tgagtgcttt ccatagcgct cgtttactcc tcaagcctgg agggtgggga 3222 0 
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gtagcatggt cccatttcac gtacaaggaa cccgatgcac agagaggtgt ggcaacccat 322 8 0 

ccaaggccat acaactgggg tgggttgagc cggggttgac tgtggcaggc tggctcaaga 32340 

gtccctgctc ctgaaccctt gccaggcagc ctggcatcag ctcggggaat ttttgccctg 32400 

acccttggaa gcaagtgggc ctctttgttc tcatgtcagt gatgagaaga gtgactttcc 32460 

tatggcccct ctggagtaca ggtgtttcct gttggcgggc tcttccccca tgacatcagc 3252 0 

agcgagctgg ttatgattcc ctacgcagaa cttgatagtt tataaagctc tttgtcatcc 32580 

aggccccgtt ggagtctcac gcagacctgg tcgcaggcgg ggctggtctt gcctgtccca 32640 

gctgcatgga tggggaactt gaggcttgca aaggttaagg ggctgttcga ggcccaggct 32700 

ggcaggagat gggcctgggc cagagtctgg gacttcccat gcctgggctg tctttggtcc 32760 

tgttgctcac catccctccc tggggccatg accttagaga gccaaatgga ggtgcaggta 32820 

acccacggca aggaggggtt gccatgactc agagtccccg tcctgtggcc ggcagtacct 32880 

ggtgcaacga cttggatttc agaccagcca ctgtagcccg ctgacggtgc gctcgaagtg 32940 

ccacagcttc tgaagccagg caggactcag gccaggagac tctgttagct gttgagaggg 33000 

agaggccaac ggatgttctg gttctgctag agagctggtt cttcggatcc tggtaccagt 33060 

gcactgagag gaggcccagc ttgattctgg ggctgccttg tggtggcatg tgctgctcac 33120 

tgacaccctc gaggagtgtc ttctctcggg cttgttgact gtgcccggtt ttccgcagtt 3 318 0 

cactggtgca cacataggca catagcaaac cgcacacaca gtcgtgggta tgagtttcac 33240 

tacattccac caccagtgtt cactaccatt acctgccttc cgtcttaagt gttcatcatt 33300 

taaaaataaa tttattgggc tggacgcggt ggctcatgac tgttatccca gcactttggg 33 3 60 

aggctgaggc gggcagatca cctgaggtca ggagttcaag accagcctgg ccaatatggt 33420 

gaaactccat ctctactaaa aatacaaaat tagctgggca tggtggggca tgcctataat 3348 0 

cccagctact caggaggctg aggcaggaga atggcgtgaa cccgagaggc agagcttaca 33540 

gtgagcccag atagcaccac tgcagtccag cgtgggcaac agtgcgagac tccatctcaa 33600 

aaaaaaaata aataaataaa agaaaaataa atttatgatc tatttcaaaa ataacacatg 33660 

tactttgaaa cagcagagac acatatgaca cggagaatga aattccccat agcgcacccc 33720 

caagagacag ccctggtccc cccgtctttc ccgtggacct ccagcggggc agatgctgag 33780 

ccgcctgttg tcgagtggcg tgctatcccg tcctccagct cctctgtggc ttacagacac 33 840 

ccacctgcag ccctgtcttt gcctcctcta gcgcccacca ccttcttgct gttcagccag 33900 

aaatctgcca tcagtcggat gatcccggac gaccagcaca gcccggatct catcctgccc 33 960 

ctgcatggac tgaggaacgt caaagccatc gactatgacc cactggacaa gttcatctac 34020 

tgggtggatg ggcgccagaa catcaagcga gccaaggacg acgggaccca ggcaggtgcc 34080 

ctgtgggaag ggtgcggggt gtgcttccca aggcgctcct cttgctggtt tccaggctgc 34140 

tgcccctgtc cttagcagag ggaggaaaca gaggatggct ctgggtgaat gatgacttgg 34200 

gcttcgatta tgtagtcaca gggtatgacc ctgagatgcg tggaaccccg agactgtgat 34260 

tatatgtaga aactgggttt ccccgttgtt taagtagtca tggtggggtc agaccccaca 3432 0 

ggacttttgt cttttcaaga aagaaaatgg tcgtgtgtca tgcaggggta gttggtactg 34380 

gttaatccag gtttatcctt tattttgtgg gaactgtaca gtcatttctg ctacaatgct 3444 0 

gtatatgctc ttctgaaaga cacctatgca aaatcgcaca gtaaaaatga cacaactcat 3450 0 

agggaaagcg gggccagggc acagccctca aaatctccat caatgacatg taagaaaaga 34560 

gaggaacctg ggaaatagca aagtgccttt tgcacattaa atggttagct atatcccaca 3462 0 

atactgtgca ttcgtaaacg ttaatgctgc aataaatacg gcacttcacc ttgggaagat 3468 0 

ctggagttgg cttatgagtg tggaagggtg tagcgcatga gtttttgtga aacactggaa 3474 0 

ggaggattgt gggaaatcaa atggaaagtt ctcaccccag gcgtggagaa gagtgggtca 34 80 0 

tggccccagc agtgagccca gggaggtcag agacggaggt gtgtgtgtgg gtgtgaccct 34860 

gcgcagttcc ctgccggctg tagttttttg cattcgctta atgtttctcg tggaggaaat 34920 

tgtgcatgag caaatgtgaa accgtgctgt gctcaaattg tcctaataca tcattgcatt 34980 

ggaacagatt ggcttttttt tttttttttt tttttttttt tttttgagat ggagtctcac 35040 

tctgtcacca gcctggagtg cagtggcatg atcttggctc actgcaacct ttgcctccta 35100 

tgttcaagtg attttcctgc ctcagcctcc tgagtaactg ggattacagg catgagccac 35160 

cgcggccggc cagatttgca tttttgaaac aactgctagg ctgggcgcgg tggctcacac 3522 0 

ctgtaatccc agcactgtgg gaggccgagg caggtggatc acctgaggtc aggggttcga 35280 

gaccagcctg gccaacatgg tgaaaccccg tctctactga atatacaaaa atcagctggg 3534 0 

tgtggtggcg ggtgcctgta atcccagcta ctcaggaggc tgaggcagga gaattgcttg 35400 

aacccaggag gcagaggttg cggtgagccg agatcacacc attgcactcc agcctgggca 3 5460 
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acaagagcaa aactccatct caaaaaataa 
tgagcacttt gcggagtcag gcttgtgtgg 
cctcaggccc acctggagtc tgcagcatgg 
aggacaggcc tggctcctaa gcagccttct 
ctagcacttt gggaggccga agtgggtgga 
gccaacatgg tgaaacccca tctctactaa 
cgcgcctgta gtcccagcta ctcgggaggc 
gaggttgcag ggatctgaga ccatgtcatt 
ccatctcaaa aaaaaaaaac ctacagagcc 
gtgggagctt gtgttattgt ggtgaaatct 
gcgcctggag agactttcac atgggtcgcc 
cccatctgaa ggctggtgcc gtccagacag 
tcccgcagca gtttctagtt acgttgtact 
agtcagccga attgaaattc agagccatgc 
ctgtgatcat tgcctctgct gtggtgtggg 
gctacaggtg actgtcacgt ggagtccttt 
ggctcccatg ctgagaaagc tcctgggcgt 
ttcttttatg gtgtctgcta ggcagaagtg 
ttccacttaa tttggaaaat gtgcttttcc 
cctgcagcct ccccgctctg tggactctgg 
cccccaaaag atggtgtcta aattatcatc 
tgtttttatt ttttattgtg gtaaatttat 
tgttattgtt gtttttgaga cagggtctca 
cgatcatggc tcactgcagc ctcagcctcc 
ctctagtagc cgggactaca ggcatacact 
ttttattgta gagacccgct atgttgccca 
cctcccacct caccctccca aagtgctggg 
ttttaatttt tttttttttt tttgagatgg 
a 9 fc 99cgtgg tatcaactca ctgcaacctc 
tcagcctcct cccgagtagc tgggattaca 
gtatttttta gcagagacgg ggtttcacca 
ctggtgatcc gcccgcctcg gcctcccaaa 
cccggccttt ttttgttttt gagacagggt 
ggtgggctct tggctcactg cagcctccgc 
ctggctaact gtattttatg tagagacaga 
atggactcaa gcagtccacc cacctcagcc 
caccgcaccc agcccatttt acctattctg 
tcacgaggta accatcactg ccattcatct 
cgaaactgtc cgcattgaac acactcctca 
tatctctctc tgccttctct aggtacctca 
tgtgtgactg gcttctttca tgtgacatgg 
gtgtcagaat ttccttcctt aaagcctgaa 
ggctcacacc ctctaatccc agcattttgg 
aggagtttga gaccagcctg gccaacatag 
ttagctgggt gtggtggcgc gcacctgtaa 
aatcgcttgt acccgggagg cagaggttgc 
gcctgggtaa cagagtgaga cttcctgtct 
acggaccact tcttgttatt tatccatcca 
tcgtgaataa ggccactatg aacatttcct 
aaaggcgtgt ttgtggtgtt gcataggatt 
atttgtacta cattaaaatg aaagctcagg 
ccagcacttt gggaggccag ggcgggcgga 
gccaacatgg tgaaaccccg tctctactaa 
cgggcacctg tagtcccagc tacttgggag 



aaaatagaaa aacaagtgct gtagcggaag 3552 0 

cctgttccac aaatgatgtg ctcacggtgg 35580 

ggcacaacag gttcattagt gtagaattcc 35640 

tttacaaaaa ctgcagagcc cgcctgtatc 35700 

tcacgaggtc aggagttcaa gaccagcctg 35760 

atatacgaaa attagctggg tgtggtggca 35820 

tgaggcagaa ttgcttgaac ctgggaggtg 35880 

gcactccagc ctgggcaaca gagcgagacg 35940 

acacggcctc tttctccacc gagtgttggt 3 6000 

tggtactttc ttgaggcaga gagaggctga 36060 

atgtccgccg tcggtttcgc tgttgtgctc 3612 0 

gctggacgcc cctttccacc agatccttcc 3 618 0 

gtgaggtctg tgtccttggt tgatggcaaa 36240 

ctggctccct ggagcttctc tcctgggcag 36300 

tggtggaaat ggattccttt catcttgctt 36360 

ggagagaggg acgtgttaat tgatggatgt 36420 

acattgcctt agagtttcat tggagctgcg 36480 

atgaagactt ggaagaaaac ccagaaggtt 3654 0 

cctcctgtgt cttttgctaa ggtccagcct 36600 

ctttgattct ttattaggag tccccctgct 36660 

caattggccg aggttttgtt ttctattaat 36720 

ataacataaa atttgccatt ttaattgttt 36780 

ccccagtgcc caggctggag tgcagtggtg 36840 

agggctccag tgatcctctc acctcagcct 3 6900 

accacatctg gctgattttt tgtatttttt 36960 

ggctggtctc aactcctgga ctcaagccat 3 702 0 

attacaggca tgagccacaa cacccagcca 37080 

agtctcactc tatcgcccag gctggagtgc 37140 

tgcctcccag gttcaagcga ctctcctgcc 37200 

ggtgcccatc actatgcctg gctaattttt 372 60 

tgttggccag gctggtcttg aactcctaac 3 732 0 

atgctgagat tacaggtgtg agccaccgtg 37380 

cttgccctgt cacccagact ggagtgcaat 3744 0 

ctcccaggct caagttgtgc acctccacac 37500 

tttcaccatg ttgcccaggc tgggcttgaa 37560 

tcccaaagtg ctgagattac aggcgcgagc 3762 0 

cagttgacag ttcagtggca ttcagtcagt 37680 

ccagactact tcaccttctc ggcagatgtc 37740 

tctccctctg acagccacca ttctactttg 37800 

tgtaagtgga attataccaa tatttgccct 3786 0 

tgtcctcaag gttcatctgt gttatagcct 37920 

taataacccg ttgtaaaggc tgggcgcggt 37980 

gagtccgagg tgggcagatc acttgaggtc 3 804 0 

tgaaaccctg gctctactaa aagtacaaaa 38100 

tcccagttac tcaggaggct gaggcaggag 38160 

agtgaaccaa gattgtgcct ctgcagtcca 3822 0 

caaaaaaaaa aaaaatcatc ggatggatgg 38280 

cgggtgctag gtttcttcca cctttggttg 3834 0 

tccgtggtga aggttttgta ctagtgagga 3 8400 

ctggtaagaa agtttgcact aaccataagt 3 8460 

ggccgggcgc ggtggctcac gcctgtaatc 3 852 0 

tcatgaggtc aggagatcaa gaccatcctg 38580 

aaataccaaa aaactagcca ggtgtggtgg 3864 0 

gctgaggcag gagaatggcg tgaacccggg 38700 



-87- 



WO 02/092000 



PCT/US02/14877 



aggcggagct tgcggtgagc cgagatcgct tcactgcact cgagcctggg caacagagca 3 8760 

agactccgtc tcacgcaaaa ctctgtctca cgcaagactc cgtctcaaaa aaaaaaagag 38820 

ttcagggttt atgaaactgg ccagccgcgt aaagtttgct gtgttgtttt tgtgcccggg 38880 

aggagtgtgg ccagggtgtc acgtcacaca gtacacgttt ctcagatggt ggttctccag 3 8940 

actgcfcgtcc caaagtctgt ttttgcatct ggttcccaca gacccaccct ccacggtgag 3 9000 

cctgattttg gccagggtag ctggaatctt gcttgtcttt cagcccggca gctgtaccag 39060 

tccagggtcc acagctagtg gcttttagga aggaatttgt tcagttggct ttgacacatg 3 9120 

gccccctagg gtccacagct ctgtagtgat gtggatgttg ttatctacaa agacacatga 3 9180 

tccttcgtgt ccagatgaaa gtgatgatgt ctttgcagct gcccagcaag gctgtgtgtg 3924 0 

tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tggtgtgtgt gtggtgtgtg tgtgtgtatg 39300 

ggggagggag gcaccctttc catctggggg tgtgtgtgtg tggggtgtgt gtgtgtgtgt 39360 

gcgcgtgtgt gtggtgtgtg gtgtgtgtgt gtgtatgggg gaggcaccct ttccatctgg 3 942 0 

gtccaagaga ctgggcctgg ggaagacgct tctttttatc tacttagaga ctttgtttta 39480 

tttgtatttt tttgagacag ggtctcactc tgtcacccag gctggggtat ggtgatatga 3954 0 

gcatagctca ctgcagcctc ggcctcccag gctgaagcga tcctcccacc tcagccttct 39600 

gaatagctgg gactgtaggc gtgcgtcacc atactgagct attgtttttt ttgtttggtt 39660 

ggtttaattt tttttgatac agatggagtc ttgctatgtt gcccagacta gtctcaaact 3972 0 

cctgaactca agtgattctc ccacctcagt ttcccgacat tctgggatca caggtgtgag 39780 

ccactgctgt ctccctgttt tattaactgc tgaaagacct agataaagaa agtctgaaaa 39840 

gacttactat cagagcacca tcctaagatg attccctctg actcaatgga gagggagggg 39900 

agcttttcct tcaggcctgg gtggcaggag cccaggtgct ccaggcccca tttgccccag 3 9960 

gccaaatcac tcgggaactt ggatgcagct gtctttcagg gtaacccaaa ggaaccagat 40020 

ccccgcaggc agtaggcttc tgggctgtcc tctcctccta cgtcagctca gtaagagccc 4 0080 

ttcgaaggga tgctgtgtcg gaggccccaa aagcccaggc tcatccctga gatgcacagg 40140 

gtgggctggg cttaggcagc gctcgagcat ctcctggacg gtgaccccag agagtgtgga 4 0200 

gacggagagt ccttgagagt cactgagaga cgtggctgcc ctgccttccc aagaggggct 4 0260 

ctgagtcatt ccccacactc acctgcccct acccaccctc acctggcccc cagcctcacc 4032 0 

tacccccaca tctgtaccga tccctttacc cgcaccttcc ctacccaccc tcacctcccc 40380 

tgtaccttca cctcccccac tcacccgccc ctgcaccctc acctgtcccc caccttcacc 40440 

taacccccac cctcacctgc cctcccctca cctggcctcc ttccgttggg gaaggggttg 40500 

taaggggcgg cccccaaact gtctgtcctg gtgccctgca gagaaaacag tacgtgaggg 40560 

ccgcagtcca aaagcttgag tcctggaagg tggaggagac agggatgtgt tgggaagggc 4062 0 

cccatggtct tggatccctt ctcgactgtc aatggggcct tcatgggagc gccagtctag 4068 0 

tgatgcacag ctgggtgccc ggcgggtggc tgaggaggcc taaagtccga ggcggcaaga 4074 0 

gctcttccag aggctgttgt cctaatcgct ctggcatact caggcgggca cgtagttagg 4 080 0 

agctgattgg agaggagaga cccccacacc aatactggga tttgactttc aggctaaact 4086 0 

tgagaagtgt ggcctctgct gtcctgccag agctctccag ccagtgccca gggctctcca 4 092 0 

gccagtgccc gggggtctcc accagtgccc gggggtctcc gccagtgcca ggggtctccg 4098 0 

ccagtgccca ggggtctccg ccagtgctca ggagtcttgg tttctttgtc ttacagccct 4104 0 

ttgttttgac ctctctgagc caaggccaaa acccagacag gcagccccac gacctcagca 41100 

tcgacatcta cagccggaca ctgttctgga cgtgcgaggc caccaatacc atcaacgtcc 4116 0 

acaggctgag cggggaagcc atgggggtgg tgctgcgtgg ggaccgcgac aagcccaggg 4122 0 

ccatcgtcgt caacgcggag cgagggtagg aggccaacgg gtgggtgggg gtgctgcccg 4128 0 

tccaggcgtg cccgccgtgt cttatgccga atgccagcct ctcacaggct ggggagactt 4134 0 

tccacctggg gatccaatgg gtggctttcc agggtcccaa aagcaaacac aggtttttca 41400 

cagcccgtcc gggaaagcag aaagccccaa ggggctggaa ggggaaaggg ggagctctgc 41460 

tgagaggtta caaggcagcg ctggccgacg ggagttgcag ttgataggtt ttgtatcatc 4152 0 

cttgttaaac ttgaaccctg tgcagaaatc ccttccacgg catgggggct gcctgttgac 415 8 0 

tcgctcctgt tccaccacag ggagctcctg ggcttcttcc tcccagaggc ccccgacgct 4164 0 

cccacctgtt ggtcgtcaga gcttctggtt ggtgggaagg cacccaggac cttgaggtct 4170 0 

ccagagagaa aagccaggga aagagggaga ccgaaaccca tgtgacatga aactcaggct 41760 

ccaaactgag cacgggaacg tttggggaca ggagcgcgat ggccttcctc agatagctgg 4182 0 

ggggctggca tgaagacggg agctacagcc agcacaggtc ctgggccggg agcccagaga 4188 0 

ttgagccctg actctgtcac ttactggcca cgtgaccttg ggcgggtggc atagcctctt 4194 0 
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ggagactcag tttcctcatt ggtaggagtg acggccacag tggtgcggcc tctgcagcac 42000 

acggggggct cggtgggcgg aagccccggg tctataaggc ggctgtgcag gagccagccg 42060 

agctggtctc ccaacagcca gggctccggg gtccttagca gctgtggggg gcctgcacct 42120 

gtttcccatg gctgctgtca gaaattacca gaagccaggt ggctgagagt aatggacact 42180 

tgttctctca cagttcctga gggctgaagc ccgagatcga ggtgtgggca gggccctgcg 42240 

ccctctgaag gctctgaggg aacctttggg cttctggtgg ctccaggcac cccttgactt 42300 

gtggtcctgt cactccagtc tctctgtctg gctgcacatg gcgtggcctc ttctgtacca 42360 

ttgaaggaca cttcagttgg atttagggcc taccctcacc cattgtggtc gtatcttgat 42420 

ccttcatgac atttgtaaag accctgcttc caaataagct cacattctga ggttctgggg 42480 

tgagcgggaa tttggagagc attgttcaac tagtatagaa tgtgacctgt cagcctcggg 42540 

cagccctgag aggcaggggc tttccacagc ccagctgggt gccctgggct ccgtgctgtc 42600 

cgaggagacg ccatccccac acccgtcctt cacccgccac cctcccgcag gtacctgtac 42660 

ttcaccaaca tgcaggaccg ggcagccaag atcgaacgcg cagccctgga cggcaccgag 42720 

cgcgaggtcc tcttcaccac cggcctcatc cgccctgtgg ccctggtggt agacaacaca 42780 

ctgggcaagc tgttctgggt ggacgcggac ctgaagcgca ttgagagctg tgacctgtca 42840 

ggtacgcgcc ccggggcctg ccctaaccgc agacacccgg ccttcattgt cagtaatggc 42900 

agcagctgcc acattgtccg agacctgccg tgagcccagt gccgcgccag gggctttgtg 42960 

tgtagcgtgt tttgtcctca cactgacagc tgtaggctgg ggttctgagt gagccccaca 43020 

gggcagaggc agaaaatgag tctcagagag ggtgagcgag ctgcttgggg ccccacagca 43080 

ggagatggag caggactgca gcctagcctc tgcccccagc acctgcgcaa gaagctgctc 43140 

tgctctggac tgtgttaggc tgcgagggct ggagagaaat gagagttggt gcttagagag 432 00 

ggggcgcagg tccccatggc ttttcctctt atgatgaggt agatgggtga agggaggggc 43260 

catgcttgca ggggccagtg accgaggccc gccgttggaa ctgatggcct tcatcccgag 43320 

cccagcccag gtgggagcag ggctttccga gggcttgtct tgggtcggcc tgcttccagg 43380 

gactctgctg cagctcccac ccctgtccaa agcatggaat cccccaggct ccctggcagt 43440 

cctgtcaacc tctgtcctcc caagctgagt gtggggcaag ttctggaggt cagcactgct 435 00 

caggggggcc cacgggctgc ttgcaggggc caaccgcctg accctggagg acgccaacat 43560 

cgtgcagcct ctgggcctga ccatccttgg caagcatctc tactggatcg accgccagca 43620 

gcagatgatc gagcgtgtgg agaagaccac cggggacaag cggactcgca tccagggccg 43680 

tgtcgcccac ctcactggca tccatgcagt ggaggaagtc agcctggagg agttctgtac 43740 

gtgggggctg gcagtggggt gggcagggtg gcctctaaac ccgacccctg gaggaggctg 43800 

gaggccagtg caagatcctg tgtggcctca gccaggcggt ggtctctgcc agatgccaac 43860 

tgttgcccgc tggggttcag cgacatgtcc gaatgtcccg aggcctctga ggttgttttc 43 92 0 

ttttgccgca gaacaaatca ccacgaacag cgttttaaga caacaccaac tctttttttt 43980 

tttttttttt tgagtcagga tcttgctctg ttgcccaggc tggggtgccc tggtgcaaac 44040 

acagttcact gcagcctcga cctctgggct taattaagtg aacaccttgc ctcagcctcc 44100 

caggtagctg ggactacagg tgggcaccac cacacctggc taattttttt ttgtagagac 44160 

ggggtttccc catgttgccc aggctggtct gcaactcctg ggcacaagct atctgcctgc 4422 0 

tgtggcctcc caaagtgcta ggattatagg tgtgagccac tggcctgaca acacccacgg 442 80 

attgtctctc agttctgtaa ggcaaagtcc aggcacagcg tggctcacct gggttctctg 44340 

ctcagggtct cacggggcca gaatcaaggt gtcaggaacg ctgggccctc agcggaggct 44400 

ct gtggagaa attagcttcc ttgctcactc agcaggtagc agttgtggga tcgaggttct 44460 

gttttctctc tggttattgg tcggggacca ctctcagctc ctagaggcca ccacaggtcc 4452 0 

ttgccccgtg gccctctctg cctcagcagt gggggctccc tgcgtcagtc cctcccacac 44580 

cttgagtctc tctgatttgc ttctaaaggg ccctgtgatt cggctcagcc acctttagat 44640 

taggttagcc tcccctttga tagactccaa gtcggctgat taataacctt aatcacatct 44700 

gcagaatccc ttctgccaca taaggtcatg acgccgtgct ggggactggg gtgggaaatt 44760 

acggggtcat ttaggattct gcctgccact gccttgctgt gtcccagggc ttgggggagg 44820 

ggcctccaca gctgggacca cagtccttcc tcccctccat ggtaaccatc tgaggattac 44880 

ttgagaccag cctgggcaac atggtgagaa cccatcccta caaaaaafcac aaacaaaaag 44940 

ggaccaggct gggcttggtg gctcatgcct ataatcccag cactttggga gaccaaggtg 45000 

ggctgatcac ttgaggttgg gagttcgaga ccagcctgcc caacatagtg aaatcccgtc 45060 

tctactaaaa atacaaaaat tagctgggtg tggtggcagg cgcctgtatt cccagctact 45120 

ggggaggctg aggtgggaga attacttgaa cctgggaggc ggaagttgca gtgagccaaa 4518 0 
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attacgccac tgcactccag cctaggcaat agagtgagac tccgtctcaa aaaaaaaaaa 4524 0 

gggccagggg tggtagtgac aaagagaccc tatcccaaaa aaaccgaaca ctgaatcctt 45300 

gagactgagt aaggacactg tgaaattttt ctgggtgggg cagggaacag agcgtcttct 45360 

gtcatttctt ccacctgggt gtggtcagct ctccctccaa gctgcctcct cttcttctca 45420 

ttgtccgggt gttggacaca tttggttaac tggatagaat aacgcgagtt cccagggact 4548 0 

tggtccattt gctattttat tttattttat tttattttat tttatttatt tatttattta 45540 

tttatttatt tattgagatg gagtttcgtt tttgtcgccc aggctggagt gcagtggcgc 45600 

gatctcggtt cactgcaacc tctgcctccc aggttcaagt gattctccta cctcagcctt 45660 

ccaagtaact gggattacag gcacccacca ccataccagg ctaatttttt tgtattttta 45720 

gtagagacgg gttttcgcca ttttgcccag gctggtcttc aactcctagc ctcaggtgat 4578 0 

ccacgcacct cggcctccca aagtgctggg attacaggca tgagccacca cgcctggcac 4584 0 

catttgctat tttaattccc atgtgtatta gtgtcccacg gctgctgtaa caaatgacca 45900 

caaactggat ggcttaaagc aacagaaatg gattccccca atgtgctgga gaccagaagc 45960 

ctgcgaccaa actgttggga gggctgtgct tcctctgggg gctccaggga ggatctattt 4602 0 

gttggccctt ccagtgctgt gggtgccagc gttccacact tgtggatgcg ccgcctcaac 46080 

ctctgcccat cttcatgtgt ccatctcctt tgtgtctgcg tctttacctc ttcttcttgt 46140 

ctgtgttgcc tcttataagg acgtttgtca ttgggtttag ggcccaccca aatcatccga 46200 

gatgacctcg tcttgagatc cttaacctgc aaagaccctt tttccaaaaa aaggttatgc 46260 

tcacagattc taggccttaa gacatgggtg tatctttctg gggggcacta tccaacccct 46320 

tatacaatga aagacgggaa gagggccagg tgtggtagtt cacgcctgta atctcagcac 46380 

tttaggaagc tgaagcggga ggatcacttg agcccaggag tttacaagta gctaggcaac 46440 

atgatgagac cccatttcta caaaaagtga aaaaaaaaaa aaaaaaaaaa aagccaggtg 46500 

tggtggctca cacctgtaat cccagcactt tgggaggctg aggcaggcag atcacgaggt 46560 

caggagattg agaccatcct ggctaacacg gtgaaacccc gtctctacta aaaatacaaa 46620 

aaattatggc cgggcgcagt ggctcccgcc tgtaatccca gcactttggg aggccgaggt 46680 

gggtgaatta caaggtcaag agatcgagac catcttggct aacacggtga aaccccatca 4674 0 

agatcacaag gtcaagagat ggagaccatc ctggctaaca cggtgaaacc ccgtctctac 4680 0 

taaaaataca aaaaattagc cgggcatggt agcgggcgcc tgtagtccca gctgctcggg 46860 

aggctgaggc aggagaatgg cgtgaacccg ggaggcggag cttgcggtga gccgagatcg 46920 

ctccatgcca ctgcactcca gcctgggtga cagagtgaga ctccgtctca aaaaaaaaaa 46980 

aaaaaaaaaa aaaaaaagaa aattagccag gcacagtggc aggtgcctat tgtcccagct 47040 

acttgggagg ctaaggcagg agaatggcat gaacccggga ggtggagttt gcagtgagcc 47100 

gagatcatgc cactgcgctc cagcctgggc gatagagcaa gactctgtct caaaaaaaaa 47160 

agccaggcat ggtggtgcat gcctgtagtc ccagctactc aagaggctga ggcaggaggg 4722 0 

ttgttcgacc cacggagatc aaggctacag tgagccatga tcgcaccact gccctccagc 472 8 0 

ctgggtgaca gagtgtgacc ctgtctcaaa gtaagtaaat aggaggagag acaagtgggc 4734 0 

agttcagact gatggtatgg gcacagtaga gactggtgca gacaggctgg cctgtgatgt 47400 

caagcaactt ctgtaattgt ttccggcatc catttgtgtg tcaatttccg tgtcagtagg 47460 

aagactctgt aggctgccaa gaggaataag tgggaggatc ctcccagaga ggccgggcct 47520 

gcaggagggc cagttctcat gagttctcat ttggccccta ccctccaggc tgtggttctg 47580 

aggtgggaga cagagcctga cctctgtttg tcttgttttg tctttgcagc agcccaccca 47640 

tgtgcccgtg acaatggtgg ctgctcccac atctgtattg ccaagggtga tgggacacca 47700 

cggtgctcat gcccagtcca cctcgtgctc ctgcagaacc tgctgacctg tggaggtagg 47760 

tgtgacctag gtgctccttt ggggtgatgg acaggtacct gattctctgc ctgctaggct 47820 

gctgcctggc atccttttaa aatcacagtc cctgtggcat ccagtttcca aagctgattg 47880 

tgtcttcctt tgccctcctt tcttttctac tatgtgcatt cggtgctatg aattttcctc 47940 

taagtactgc gtttcctgca tctcacaaat tttgttacat tttcattttc aggtagtttg 48000 

aatattttta cacttctcct gagatgacat ctttggctca tgtgttattt agaagtgttg 48060 

cttagtttct aaagagttgg ggcttttcca gctgtctctc tgcaactgat ttctaattta 4 812 0 

attctactgt agtctgagag cttattttat atgatttctg ttattttaaa tgtgttgggt 48180 

gtggtgtttt tgttgttatt gtttttgtgt ctttttgttt tgttttgctt cgtttgtttt 48240 

gtttttgaga cagtgtcttg ctctgtcact caggctggag tgcaatggcg cgatctcagc 48300 

tcaccgcaac ctctgcctcc cgggttcaag tgatcctctt gcctcagcct cctgagtagc 48360 

tgggattaca ggtgcacgcc accataccca gctaattttt gtatttttag tagagacggg 4842 0 



-90- 



WO 02/092000 



PCT7US02/14877 



gtttcaccat gttggtcagg ctggtctcga 
cctcccaaag tgctgggatt ataggcgtga 
atcacccagc atcatgcagt ttatcttggt 
ggattctgct gttgttgggt ggagtgttcc 
agtgctcatc aggttgtctc tatccttcct 
tgacaggggt gtggag'tctc caactctaat 
tatctttttc tctccttcta cccttgatcc 
tggtgggaga gtggggtagt gaagaacctg 
atcctggctc tgtcacttcc cagttgagtg 
ctccacttcc tcctctgtga aattgagagc 
gtaacagggc actccacctg gaccctgaca 
catgggtgtg gcagtagtaa taaagtgacc 
cagggctttc tctcctatgc ccccatgcct 
gctctgctca gcgacctttc tcccgtggga 
cagcagcaca cactggctgt gcaccctttt 
tctcgctttt ttcgcgcagg ctgaagtgca 
acctcctggg ttcaagtgat tttcctgcct 
cccaccacca cgcccggcta atttttgtat 
gccaggatgg tcttgaactc ctgacctcag 
cagggattac aggcgtgagc caccacaccc 
tgttcctgga gagactggct cctgcccagg 
ctcacacctc gagtttggcc gtaagcagag 
cttcccagca gcttttctcc aagttacagc 
ggtgtctgtg cacctcccac tgatgcccga 
aacgctgcac taactgcagc cttgggcctc 
accccctggc ctgggccctg ggccctcctg 
ctcgagagac gtgtatttac cgcacaggtg 
ggagatcaca gaaggacagg gcctcactga 
aggagccagg ctgggcaaga ggcggccaca 
ttcagccctt ccctgagccg ggcgcgcccc 
tcacaggagc ttgtggtgcc cagcggctgc 
999 a 99 ct 9 a gaggctgctc ggtttcacaa 
gtatgcccca tgtcgccctc aacctccagc 
cacggcttgc agacatctgt cgttggctcc 
tcctcctgtg gctgagccca gtggggacag 
tctttgatca ggccccctcc cctatgccac 
gtcatgggga aggcctgcgg gttccaaaca 
gaaaccgatg ttttttacct tgatcagatt 
agtgaggcct gggccgattt cccagcatcc 
ttttctgcac aaagtgggag ggaggagtcc 
gcctcctctg gcttctctgt gccagtgccc 
gggactctgc tattccctga gtgctgccgc 
ccgaggggca ggatggcctc catggtcaca 
cctctccctc ctccctttca gcgccctccc 
tttgtcccgt cttctgcagg cgcctgggac 
ttgctatgcg cacgctggtc accacagagg 
ccgcaacagg tgtggctgct gaccacctgc 
agtgggcact caggcgtggc tgagcagatg 
agtcagggct gaatttccca cccggagcat 
ttaaagctct gattcccagt tggcaccctt 
atgtcttctt gcagtgaatg tgtattctcc 
ggagcggcac acaggagccg cagcgatgga 
gggatgccaa acccgcgctg agtccctctc 
tgcctgggtt ttgctgggcg gggctgcgtg 



actcctgacc tcgtgatccg cccacctcgg 48480 

gccactgtgc ctggccatta ggtgtgtttt 48540 

gaatgttctg tgtactcttg aaaagaatgt 48600 

agaaacatca attagatcca gttggttaat 48660 

tcctgactgc ctgcttgagc tgtcagttat 48720 

ggtggatttg tttatttctc ctagtagttc 48780 

tcttctcccc ctagggcttc ctggtgttag 48840 

gactttaggg ccaaagaggc cagggttcaa 48900 

accctggctg gtgcctgaat ctctgtgagc 48960 

acttacctgg caggctgtca tgggcatcaa 49020 

cgtgatgcac aggaatgcca gctgctatgc 49080 

atctgtatcc tcaccacagt gaagcctgtc 49140 

ccaggtggcc ttggatcctg ttggttctgt 492 00 

gttcctgggg gttcagcttc atcctacaga 49260 

tttttttttt tttttttttt tgagatggag 49320 

gtggtgtgat cttggctcac tgcaacctct 49380 

caccctccca. agtagctggg attacaggct 49440 

tttcagtaga gatggtgttt caccatgttg 49500 

gtgatccgcc cacctcagcc tcccaaagtg 49560 

ggagtgccgg ttgtttttag cagtttgtct 49620 

agctcgggga gtagggccgc ggggtgctgc 49680 

gggacatttt gtgactgtcc ccctcctgag 49740 

ccaaaagctc aggtggattt gcaacccaac 49800 

actgccctgg ccaagaaacg gggccgtcag 49860 

catgccagag gccatgccct tccatccacc 49920 

gctcgggaac tccaggcccc ttcctcacgg 49980 

cttgtcattc tcttgtggcc tcttctccag 50040 

ggtctcggac atggaccctt tgatagtggc 50100 

gtcacctcag cagtgccatc accaccgcca 50160 

tggctctggc cccagtgtcc cagttacagc 50220 

ttctgattga gagtcgaggt cggaggcttt 50280 

ctgctgaggg agacttgggc tccatctcag 50340 

caccggtcct ccgtgtcccc catggccagg 50400 

tctcagccgt cgtgggctga ccctggcacg 50460 

ctgcttcctt ttattaccct agaactctcg 5052 0 

acagtccctg tcactcgggt gagcccagta 50580 

tccaaaggct tgcgtgcagc atgacagctt 5064 0 

tcagcttggc gggggctttg ctcagctttc 50700 

cctcctgagg ccagcctctg tttcctgtga 50760 

taggaaatgg ggggccacct cgaagcctag 5082 0 

ccacgctttg tgtctgtgtc cccagcccat 50880 

atgcccagcc cgcactgagg acgtggagcc 50940 

cgtaggaagt ggcctccacc ctccgatgat 51000 

cgggggtgtc ctcagccctc ctgcctgtgc 51060 

gtgctgacag gtcctctgcc ggctcctgcc 51120 

cctggccctt cttctgtagc agtcccacac 51180 

tttctgcccc tctggtcctg aggagggcgc 5124 0 

tgtgttgccg ggaggaggaa ggactgctcc 513 00 

ttctgctgta tttggtgtag cgcctgctgc 51360 

tcccttctgc attgaaaaac atacggatgc 5142 0 

cagcctctct tctgggttgg ggctggaggt 51480 

ggatgtgcgg gtgcagcacc ccgtacagca 51540 

aacttctgct ttgaagccca gtcacgccat 51600 

tgatgttctc ctctgtccct cccccagagc 51660 
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cgcccacctg ctccccggac cagtttgcat gtgccacagg ggagatcgac tgtatccccg 51720 

gggcctggcg ctgtgacggc tttcccgagt gcgatgacca gagcgacgag gagggctgcc 51780 

ccgtgtgctc cgccgcccag ttcccctgcg cgcggggtca gtgtgtggac ctgcgcctgc 51840 

gctgcgacgg cgaggcagac tgtcaggacc gctcagacga ggcggactgt gacggtgagg 51900 

ccctccccgt caaggctctg ccaagaccct ggccctgccc tccgggatac gagcttgggg 51960 

ctgcctccgg cctcacagga gtaggggctc tgaaaacctt tgcttgcagg gagattgcca 52020 

agtctgtctt ttaggcccaa caaggaaaac tctgcagttc cacccatcct gtcccaccag 52080 

gtagtgtggc ttgaaggcag actgtgaggg tctatctcac cttcctgcat taggtcagga 52140 

gtttcacaga aacctgaggc acattcaggg gtgggctgca gaggtccatg gctcacaccc 52200 

tggaaaatcc gcccccaaaa gacagtgctg tctccactga ccagtctgtg ggatagtgct 52260 

taagcctgag tggtttctat caacatgtag aatcaggagg tataaagaga tttgctcagg 52320 

catcctgggc cctctctgac cagcaggatc ttcctttaga tcttgacagt gaaacacatc 52380 

tcttctgtgc cccctgtgag ttttctttca ttcattcatt cattcattca ttcattcatt 52440 

cattcattcg agacagagtc ttgctctgtc acccaggctg gagtgccctg gtgtaatctc 52500 

ggctcactgc aacctctgcc tccagggttc aatcgattct cctgcctcag cctcccgagt 52560 

agctgggatg acaggtgcgc accaccatgc ctggctaatt tttgtatttt tagtagagac 52 62 0 

agggtttcac catgttggcc aggctggtct cgaactcctg acctcaggtg atccgcccgc 52680 

ctcagcctcc caaagtgctg ggattacagg catgagccac cgcgcccggc ctgagttttc 52740 

cttttatgaa ggacctgctt ggttggttgc ctgccacatg ttgtcagcac catgggccca 52800 

ggactgctga ggagctgttg atgccctcgc tctcccagag ccaccggctc tgttagataa 52 860 

ttcacatgca gtctggccac tgtcctacgt cctcattcac aaagagcaga catttcgtag 52 92 0 

aagatgaggg cctgggagta acctccctgc atgtttttct ataaaggcat agtggttaag 52 98 0 

tccttccagc tcattgacca ttggagaatt ttatggaggc tgtagactag gggctggtaa 53 040 

actaagggcc caggggccaa atccagcctg ccacctactt ttgtaaataa agttttcttg 53100 

gtgcacagcc atgcccattc attcatttgc acaatgtctg tggctgcttt catgccaaaa 53160 

gcaagagaac tgagtggtta tgctggagac ctacggcctt caaagcccca gacctcacgt 53220 

ctggcccttg acagacagag cttccccagc cctgctgcgc atcctggccc agcatgtgct 53280 

gtgtgtgtga tttcagcttg caggagccgt ggttaggaat tgtccctgtg ttggtccatt 53340 

ttgcattgct atgaaggagc acctgaggcc gggtagatta tgaaggaaag aggtctgtct 534 00 

ggctcatggt tctgtaggca gcaccagtat ggcacccgca tctgctcagc ttctagtgag 53460 

gtctcaggaa gctttgactc atggtgaaag tcgaagcggg agcaggtgca tcacatggtg 53520 

agagagggag caacggagag agagagagag cgcctctccc tcttgccctc accttgagag 53580 

gagatgccag gctcctttaa gtaaccagct cccatgtgaa ctcacagtga gagcccattt 53 640 

gctactgcgg agagggcacc aggcatctgc tcccatgacc caaacactgc ccaccaggcc 53700 

ctacctccaa ccttggggtc atattttatt ctgttctatg ctatgctatg ctatgccatg 53760 

ccatgccatg ccatgctatt cctattctat tatttgagac agaatctcgc tctgttgccc 53 82 0 

aggctggagt gcagtggcat gatcttggct cactgcaacc tccacctccc aggttcaagc 53 88 0 

gattctcctg tatcagcctc ccgagtagct gggattacag gcacacacca ccacacccgg 53940 

ctaatttttg tattttcaat agagatgggg tttcaccatg ttggccaggc tggtctcaaa 54000 

ctcctggcct caagtgatcc acctacctcg gcctcccaaa gtgccatgat tacagatgtg 54060 

agtcactgcg cccagtgagg gtcacatttc cgttgagatt tggaggggca gacgttggag 54120 

ccatctgagc cccctcgtcc cgctctagct tctcctcccg tgtgccccgc ggtgctggtg 54180 

gcaggccctt acgccggttc tggctgcatg ctctgttcca gaagctttct tccctgcttg 54240 

gttaccagaa aatcatccca tccattacaa ggacagggtc cccttatctc ccattcccag 54300 

ggcaggacac cgggggcagg gcaggtgggg aactgagcaa gttctctggg ggcaggcgtg 54360 

gctatggctc cctctgggtg ggcgtctggg gaggggtgga ggcagccgtc agcgccctgg 5442 0 

cttgctcttc ctccctggcc agagactgtg gccttgtgct gctcccgtgt gggctgcctg 54480 

cacctccagt gggttgtgct ccctcccctc ccctcccctc aagctctgct gagcaccact 54540 

gccttccaca gcccccactc tcgggaggcg aggctcctcg tggccattcc tgtccttggc 54600 

acccaccccc ccaccaacct ggtagagcct tgggcggggt ctgttactcc ttgcatggcg 54660 

tagacctccc cacagtaggc acctgacaca tacctcctgg ggggcaggca ggaggtgcgt 54720 

tgaggtctca gccctggcag tccctcccct gcgtggcata ggcctcgcca cagggtcatc 5478 0 

gagggtgggt ggagactgta ctagaccact ccccgctggt cctagaaagg gtcccatctg 5484 0 

tctgctctct gtttggagtc cagaccttgg ttgctgtgcc ctgcatggtg ggctgggggg 54900 
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caccctccag cctctctgag tgcatggcct ctccttgcag ccatctgcct gcccaaccag 54960 

ttccggtgtg cgagcggcca gtgtgtcctc atcaaacagc agtgcgactc cttccccgac 5502 0 

tgtatcgacg gctccgacga gctcatgtgt ggtgagccag cttctggcac ggggaagggg 550 80 

cgtccgggct gggttccccc aggaacgtgg agtttagggg aggagacgtg cctttccagc 55140 

ggggctgggg gctgtgtggg agactcaggc ggctgggagg ctccttgcgg gaggcaggga 552 00 

agcctttccc agggcagcgg ccaggaggac agactgtgag ctgtgggctc ggcggctaca 552 60 

gagtctgcct cagtgggcgg ggctgatggt gtccaggtgc ctgcagcacg cacccaccca 55320 

cgggaccttg ctgagcagcg tctgtcaggc agcaagatta cccgagggct gcagtggtcc 553 8 0 

tgttccctgg cagcttactg tctggctgag gaggagtgat gttcacatat gcacacatgt 55440 

catgtgcaca cacatgtaca tgacaacatc ccacatgctc ctcaaatagc atgacctgta 55500 

cagtcacgga tatagggcct aggggatagg aggccaagac agtcagggaa gactttccag 55560 

aggcagtggc tcctgaaagg ctgtctgatt caggcaggaa gggagctgag ttcagatagg 5562 0 

aagtagcaat gagtcattgt gtctggggac atggccactc cttcgctgca gagggacctg 55680 

ggctgagagc tcctctctta tggctgcagt cgggagagaa gtctgttggg gggagaaggg 5574 0 

ggcttcctca agggactccc tgtgcccttt ggcaccttcg tgccaggtca ggcttgaggc 55800 

ctgaaggcag tggtgggggc caccaagggt cgcctcctct gctgggcaag ttcccagtct 55860 

gacgggcctg tgccgtgggc cccagctgtg ggggcgctgt tgatgcgcag ccaggcctcg 5592 0 

ccgccagagc ccgcacgctt ccattccgct gacttcatcg acgccctcag gatcgctggg 55980 

ccggccctgt gggagagtga atgtggcttt tgccaaagtt gagtctggag cctggaaact 5604 0 

tccctatggg cagccttgat agtggagtgg cccaaggagc ccacccagcc gaccctgccc 56100 

ctcccgtggc tggtgggcgg caccaggggc tgcctggctt tgctcgttca ccaacatcac 56160 

ctgggctggc cagggcgcgc tcacttctgc caccaccgag ggccctgggc gaaggagtga 5622 0 

ataccaggct gccttggcag ggatgtgttg agggctgtgg ggagtcggac agcggcgggg 56280 

gtcagaggag gaggagggtg caccgtgcag gctgaagggc cacgttaccc tgaggttggc 56340 

caggctcccc aggcctagcc tcccagctcc cccactttct ccccaccctc caccagtggc 56400 

aaagccagcc ccttcagggc gcacggtgtc tgcccccaag gagggcccat tccgttgggg 56460 

ttaatgttgg ccacctcttt ctgtttgtct ctggcagaaa tcaccaagcc gccctcagac 5652 0 

gacagcccgg cccacagcag tgccatcggg cccgtcattg gcatcatcct ctctctcttc 56580 

gtcatgggtg gtgtctattt tgtgtgccag cgcgtggtgt gccagcgcta tgcgggggcc 5664 0 

aacgggccct tcccgcacga gtatgtcagc gggaccccgc acgtgcccct caatttcata 56700 

gccccgggcg gttcccagca tggccccttc acaggtaagg agcctgagat atggaatgat 56760 

ctggaggagg caggagagta gtctgggcag ctttggggag tggagcaggg atgtgctacc 5682 0 

ccaggccctc ttgcacatgt ggcagacatt gctaatcgat cacagcattc agcctttccc 56880 

actgagcctg tgcttggcat cagaatcctt caacacagag gcctgcatgg ctgtagcaac 56940 

ccaccctttg gcactgtagg tgtggagaaa gctccttgga cttgaccttc atattctagt 570 0 0 

aggacatgtg ctgtgttgtc cacaaatcct catgtaccct agaaatgaat gtgggggcgg 57060 

ctgggctctc tccagagctg aaggaatcac tctgtaccat acagcagctt tgtcttgagt 5712 0 

gcagctggga tttgtggctg agcagttaca attcctacgt ggcccaggca ccaggaacgc 57180 

aggctgtgtt tgtagatggc tgggcagccg caccgcagag ctgcaccatg ctggtttgta 57240 

tcacatgggt gaccatggta tgtctaagaa ggtggagtcc ctgtgaggtc tgcaggtgcc 57300 

cccacagctc caggccacct tgaggattgc ctctgcctgc ccagccctga gttccctctc 57360 

ccctgtcctg tcccactgtc accccaagcc ggcctcattg ggagcctgtt ggatggcagg 5742 0 

gtatagatgt aacctgattc tctctgggga gcggggttat ctggcttctc aagagctcct 5748 0 

aggagcccac agtggtggca ccatcacagt cgcagcagcc cccagagaac gcggccctgt 5754 0 

ctgttcctgg cgtgctctgt gctgccccgc ctgggttccc tgccccagtc gcaggcccct 5760 0 

tggaggaggt accatgtgtc tcccgtttca cagatgagcc ccggggagct cactctagta 57660 

gtggccagag aggcctgcgg ctcagggagc ggggcacatt tccaacagga cacaccgccc 5772 0 

tggtctgagt ctcgtgggta gtgggagcag aggagagcgc cctatgtctg tggggcggct 57780 

tggctgagcc tggaagccac ctgacctccc ccgtcccttc cctgccaggc atcgcatgcg 57840 

gaaagtccat gatgagctcc gtgagcctga tggggggccg gggcggggtg cccctctacg 57900 

accggaacca cgtcacaggg gcctcgtcca gcagctcgtc cagcacgaag gccacgctgt 57960 

acccgccggt gaggggcggg gccggggagg ggcggggcgg gatggggctg tgggcccctc 58020 

ccaccgtcag tgctggccac cggaggcttc ccgggttcct gggggctgtg ccaccgcctc 58080 

tgaggcatgc ttgctttctt cccttttcaa acccttctgc ttccttcttt aatgacattg 5 814 0 
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ttgattgtgg ataatctgaa aactacacaa aaatataaag agccaaaatc tcacccaaat 58200 

ccacctccta gagtggctgt tgggctccgt cagcatccag gcggccgtct gtgttccgca 58260 

cggcccagcc catcgatagc cgcctgcacc aggcctgtct gccctctgtg agcctcccca 5832 0 

cagggttccc tccacaaaca ccctgttctc ccacccaggg ctggctgctt cctggaaaac 58380 

agctggatgg ttttgtgcat gacagacaaa cacagggtga ttttcgtggc taaaatactc 5844 0 

cctggagctt ttggcagggt gaggggctgg ctccagctga gccacgcctt gagtgaaatg 58500 

actgtgagga gaataaactg ccgctgccct ccaggatcac tggggctggc tggggagaac 58560 

ccccgtttct gggagcacag tcccaggatg ccaaggcgag cttggtgccg agatgtgaac 5862 0 

tcctgagtgt aaacagcggg ggctgacttg acatgctttg tatgcttttc atttgttcct 58680 

gcagctgtat gcccctaagg tgagtccagc ccccttctgc ttcctctggg gcctcgccag 58740 

tgagccccac cttgctgggg ctggttcctc ctgcccttct gggtatccct cacatctggg 58800 

gtcttgtctt cttgttttct ttttcttttt tttttgagac ggagtttcac ttttgttgcc 58860 

caggcttcag tgcaatggtg tgatctctag gctcaccgca acctctgcct cccaggttca 58920 

agcagttccc ctgcctcagc ctccctagta gctgggatta caggcatgtg ccaccacgcc 58980 

cagctaattt tgtattttta gtagagatgg ggtttctcca tgttggtcag gctgatcttg 59040 

aactccctac ctcaggtgat ccgcccacct tggcctccca aagtgctggg attacaggcg 59100 

tgagccaccg cacctggcct ttttcttttc ttttcttttc ttttttctga gacagggtct 59160 

cgctctgtca cccaggctgg agtgcaatgg tgtcatcatg gctaactgca gcctctacct 59220 

tctaggctca agcaatcctc ccatctcagc ccctaagtag ctaggactgc acgcatgcat 592 8 0 

ccccatgccc agctaatatt tacatttttt gtagagatga agtttcacta tattgcccag 59340 

gctggtctcc aactcctgga ctcgagcgat cctcctgcct cggcctcccc aggtgctggg 59400 

attacaggcg tgagccaccg tgcctggcct ggggtattgt cttcttatgg cacctgactg 59460 

tggtgggccc tgggaaggaa gtagcagaag agggttcttc ttggtttcct ggacagtaac 5952 0 

tgagtgttct ggaggcccca gggcctggct ttgtttaggg acaaagggaa ctggtaacca 59580 

gaagccgaga gtttaaacac ccactgccct tcttccctgc tcctgctgct gcaacccagc 59640 

ttaaccagcc aggagtgcta ggaacccaag cagggccccc gagcacacag caggcagctc 59700 

acgaattctc ttttcctgtt ctcccttggg agctgggagg atcttaatca ggcaataaga 59760 

gatggcactg agcagccagc taatttttta aatcacttta ttgtttaacc atatgactca 59820 

cccacttaaa aaagggtaca gttcagtggg ttttagtgta ttcacagatg tgtgcaaccc 59880 

tcaccacagt taattttaga acattttcct gcccctaaaa gaaactctgc atgaagccag 5994 0 

ctgtttttaa attagcaaag ttattttgca tcctttaaat atatgttcat ggtacaaaat 60000 

tcaaaagata cagaagagtc tgcagtccaa agagactccg cccccatgac gccaagcagg 600 60 

catccctggg aggcatggcc tcctgcagtg tgtttcttct atgtcccccc aggggtcatc 60120 

tgtacatatg caagcataca agagcgtgga ctttgttttc caagccagaa gataattgta 60180 

gatttatgtg cagttgtgag aaagagcaca gacccattta tcctctgcct ggtttccccc 60240 

agtgctgcct gccatcttgc atgacttcca ttcctatcat aagcaagaca ctgataacga 603 0 0 

ttctttcacc ttattcagat tgacataagt gttttttgtt tgttcttgag acaaacttcc 60360 

tctgtcaccc agtgggagtg cagtggcaca atcacagctc actgcagcct caaactcctg 6042 0 

ggctcaagcg attctcctgc ctcagtcccc tcaagtagct cagatggcag gtgtgcacca 60480 

tcatgccagg ctaattttta aattttttgt ggaggtgagg cctcactaaa tttcctgggc 60540 

tagtcttgaa ctcctgagct aaagtgatcc tcctgcctca gcctcccaaa gtggtaggat 60600 

tacaggcatg agccactgcg cctgggctga catatgtgtt ttcgtaagcc cgaaagatag 60660 

catctgaaga gtcaacattg agccttgcct tttgctgcta acgatgtata aaagctgctg 60720 

ttctgagcat ttcggaggct cccagctgcc gtgtgcaccc tgcctagagc tctaccgtaa 60780 

cccatctccg ggaggaggtg ctattgtttt cctcattttg caacaaggag gctgaagaac 6084 0 

tgagcatgaa ccactggcct gggtcgttcg gttggtaggc agtggggcca ggccatccaa 6090 0 

ctcacaacca ccttctactc tgcttccccc gcaccctgaa gtttgttctg ttttgaggac 60960 

acagccgtca cattcttggt ggctgaacag cactccttgt caggcgtggc tgggccccca 6102 0 

ctggagggca tcatggtcct ctctcctgct gcggttgaac cttggctgtt tcaaccactc 61080 

ctgccaagtg gccctctgaa agggacagtc catcttttct cagcagaggg ccacactggc 61140 

aaaacggtcc ctggcaccct ttctctccac ctgtctaata tagagtaaaa atggtatcat 612 00 

gttaagatct tcatttatat ttattttatc atgaatgatg taagcatcat tttgtgtgtt 612 60 

taagaacctt tgggcccagc gtgatggctt gcagctgtaa tctcagcact ttaggaggct 61320 

gagatgagcg gatcacttga ggccgggagt ttgagaccag cctggccaac atggagaaac 613 8 0 
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cccgtctcta gtaaaaattt aaaaattagc cgggtatggt gatcccagct acttgggagt 61440 

ctgaagcatg agaattgctt gaacatggga ggcggaggtt gcagtgagcc gagatcgcgc 61500 

cattgcactc cagcctgggc gacagagcga gactctgtct caaaaaaaaa aaaaaaaaag 61560 

aaaagaaaag aaattatcaa tctcctcttt tatggcatat atatatatat atatatatat 61620 

atatatatat atatatattt tttttttttg gttatgttca gaaaggcctt ccctgctctg 61680 

atcataaaaa acaacttatt ttcacactct ctctcttttt tttttgagac agagttttgc 61740 

tcctgttgcc caggctggag tgcagtggcg caatctcagc tcactgtaac ctccgcctcc 61800 

cgggttggag tgattctcct gccttacctt cccgagtagc tgggattata ggcatgcacc 61860 

accatgcctg gctaattttg tacttttagt agagacgggg gtttctccat gttggtcagg 61920 

ctggtctcga actcgcgacc tcaggtgatc cacccacctc ggcctcccaa agtgctggga 61980 

ttacagacgt gagccaccat gcccagccca cactctcttt cttaacgtcc tcctcctttc 62040 

gttttacgtt cacatcttta attcttctgg gatgtaatta gatttgatga gcaaggtggg 6210 0 

catccagctt gtttcttggc tgatggctta tgggtggcgt gaattagtcg gggtctatca 62160 

ggaggcagaa actctatgag aatttgaaca gagaaagttc cgtctacagg cttattacca 62220 

gggactggaa tagcagaaat tgaacagtga gatgtacaga gaactctaag aatgcaggaa 622 80 

taggccaggc atggtggctc acacctgtca tcccagcact ttgggagacc aaggcgggtg 62340 

gatcacctga ggtcaggagt tcgagaccag cctggccaac atagtgaaac cccatctcta 62400 

ctaaaaatac aaaaaaatta gctgggtgtg gtggcgcatg cctgtaatcc cagctactcg 62460 

ggaggctgag gcaggagaat cacttgaacc tgggaggcag aggttgcagt gagccgagat 62520 

catgccactg tactccagcc tgggtggaag agcggaactc tgtctgaaaa aaaaaaaaaa 62580 

aacaagaagt tcaacttgaa gggaaaaatg ccgtattgtc tttccctttg ttatgtcacc 62640 

agggcacagt ccatcccagg ctggcgctga tccacgggct ggagaggggc tgccccagaa 62700 

gaggacatgc caggaagggc ttggctggtg ttcaggagcc caggccaggt caggtcaaga 62760 

ggtgttgagg ctggacggga gaggccagct aggggctcat gtaggatatg aggggtcggc 62 82 0 

ccatttcaac gtggaaactg agctcttctg cttctctttc ttcttcactg cattaagatt 62880 

caataccgct tgggaagcag gtatttccct tcctataaag gatggttggg agcctgagtg 62940 

ttgggagaaa gtgtagccgc tgagttacta acaactaggg ctgccgtcaa gcctatgggg 63000 

aaagagagaa gaggacattt ggaaggagag agatcaagcfc gtggcaccct gggagaggac 63060 

cacagaaaag aggccagtga gggggttccc cggtggcatc tgaaggtgtg gcccaaccag 6312 0 

gaggtccaga ggctgccagc cgagtggccc aggagaggga acctcacagg ggctgagtgg 63180 

gacccaagcc ctatccaccg tcctaaccac ccacatttct cgggaacaag acctcccaca 63240 

gtggcctccc cggcagtgga aatagccaaa ctggcaacat ggactttctt caactgcccg 633 0 0 

ggcgatgctg cctcagtgcc ccagggcagg caggaagctc ccacacccat tctggaatga 63360 

ggggttggag gaaggctgag ctgagcaaag gacccatctc tgctctggtt ggtggggagg 6342 0 

gagcccatta tacaagagac ccctcagggc tcagtgaggg gtgacagaga cttggggagt 634 80 

agtggctgtc actgcagagg tgagagggtt tggagagaag gtacatgcct ttttggccac 63540 

attgagtagc acctggtagc cagttagtaa cgtgtattgg ataaacaaaa gattaaacgg 63600 

atgcaaaaaa aaatgttggc tttgcttctt tttacccaaa cctcagttcc ctcaagtaga 63660 

ttctgggaac accccctacc tggctggact gttgtgaagt ttaaataagc caggttaact 6372 0 

tcacctcctc ctttaagaca cagctcagac actgcctcct ccaagaagcc ccctctggct 63780 

tcctgtgtga atatgacggc cctctgggct ctagggtatc ttagaacaat gcttccttat 6384 0 

ggctttggaa ccccgctgtc tcctggattg ggagcaaatg caggggagga gccacacctg 6390 0 

actaatctct gggtctccca gcacataagt ggcataaggg cagggctgtg cccgcttcag 63960 

gcacttactg aaggatgtac ttggcagagg gtaggcagcc ggcggatgag cccctcactc 6402 0 

tccccagctg actgcgtggg cgggaaaggc gggttcagga gacccagcct ccctgggctg 6408 0 

tcaccacctc tgcacatcca gccccattga tcaagggttc aatttttggg gtcctgttgg 6414 0 

gaggccagga gactctctcc aggcacttct tccaggtctt tgtgttaggg tgtgtgtgtg 642 00 

tgtgtgtgtg tgtgtgtgtg tgtgttgttt gttttatttt atttatttat ttatttattt 64260 

atttatttat ttatttattt tgagacgcag tctcgctctg ttgcccaggt tggagggtgg 6432 0 

tggcatgatc tcggctcact gcaagctccg cctcccgggt tcacgccatt ctcctgcctc 643 8 0 

actcttcctg agtagccgga ttacaggcgc acgcaccatg cctggctaat tattttgttt 64440 

ttttagtaga gacagggttt cgccacgttg cccaggctgg tcttgaatcc ctggcctcaa 64500 

gcgatccgcc cgcctcagcc tcccaaagtg ctgggattac aggcgtgagc caccgtgccc 64560 

gcccagccta ggggtacatg aaactttttt tttttttttt ttgagacaga gtttcactct 6462 0 
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gtcctcaggc tggagtgcag tggcgtgatc tcggcgtact gcaatctccg cctcccggtt 64680 

caagcgattc tcctgcctca gcctcccgag tagctgggat tgcaggcacg cgccaccaca 64740 

cccagctaat ttttgtattt ttagtagaga cgggctttca ccatgtggga caggatggtc 64800 

tcgatctcct gacctcgtga tccgcccgcc tcagcctccg aaagtgctgg gattacaggc 64860 

ctgagccacc gtgcccagcc atgatgtttt gatacaggca tataacgtat aataatcaca 64920 

tcagggtaaa tgatgtaacc atcacatcaa gcatttatcc tttgtgttac aaaaaaaaat 64980 

ctaattatac tttcctactt attctttttt tttttttttt ttgagacgga gtctccctca 65040 

gtcgcccagg ctggagtgca gtggcatgat ctcagttcac tgcaagctct gcctcctagc 65100 

tctgcctcct gggttcatgc cattctcctg tctcagcctc gcgagtagct gggactacag 65160 

gcgcctgcca ccgtgcccgg ctaatttttt tttttgtatt tttggtagag acagggtttc 65220 

accgtgttag ccaggatggt ctcgatctcc tgaccfccata atccgcccgt ctcggcctcc 652 8 0 

caaagtgctg ggattacagg catgagccac cgcccccagc ctatttattc ttaaatgtac 65340 

aataaattat tgttgactcc agtcaccctg ctgtgctacc aaatacggat cttcttcatt 65400 

ctatctaact gtatttctgt acctgttaac catctctcct ccacctcacc ccccaaaccc 65460 

actacccttc tcagcctctg gtaaccatcc ttctactctc tatctctatg agttcaattg 65520 

tattaatttt tagctccccg gccgggcacg gtggctcacg cctgtaatcc cagcacttca 65580 

99 a 9gctgag gcaggtggat cacgaggtca ggagtttgag accagcctgg ccaacatggt 65640 

ggaaccccat ctctactaaa aacacaaaaa ttagctgggc gtggtggtgg gcgcttgtag 65700 

tcccagctac ttgggaggct gaggcaggag aatcgcttga aactgggagg cagaggttgc 65760 

agtgagccaa gattgcgcca ctgcactcca gtctgggtga cagagtaaga ttccatcccg 6582 0 

aaaaaaaaaa agtttagctc ccacaaataa gtgagaacac gtgaagtttc tctttctgtg 65880 

cctcgcttgt ttcacttaac ataatgacct ccagttccat ccacgttgtt gctttgttat 65940 

aaatgacagg atcttggtca ggcgcagtgg ctcatgcctg taatcccagc actttgggag 66000 

gctgaggtgg actgatcatg aggtcaagag atcgagacca tcctggctaa cacagtgaaa 66060 

ccccgtctct actaaaaata caagaaatta gccgggcgtg gtggtgggca cccatttccg 66120 

ccccttctcg ggacgctgat gcacgacata ttacccatcc ccggaagact aatcctcccc 66180 

cactctatat tgtacctctt cctttctcct ccacgcgatt ccccgagtaa cccgtcttcc 66240 

ctccctcctc ggattacgct cacctttccg cttcaatcac gttgctccgt ccccttcccc 66300 

attcgtacca ctcctcactt tcgtcttcct acccccacta tcccttttcg tcctctctat 66360 

tccttactta ctcctccccc ttctcttcat acttcattcc ctccgctctt cccactcgcg 66420 

ctcccacttt cacctagttg ccctcaccta cgttgccatc tcgccccttc ttcagctctc 66480 

ggcctctcac ccatctgtcc tctctcttac ctctctcctc atctcgctca gacatctctc 66540 

tagactatcc ctcactttac cttctcagtc gtcttcttcc tatccttcgt tctccatgat 66600 

cttcacgtcg ccatctcttt tcgccccttt catatgtctc tcttcatgtt ctcactatca 66660 

ttctcatgat cactatcgtt ctcactactt atcactcccc tctttcttca tcaattcctc 6672 0 

tccgtcattc tcgtctctct cttacaaccg ccttccttgt gctatctaac tcaaccatgc 66780 

ctctcctact ctctctctat cgcccctcca tcgcttatgc atcctcttct attgcacacc 66840 

cgcccctcca tcgcttatgc atcctcttct attgcacacc gcccctccat cgcttatgca 66900 

tcctcttcta ttgcacatcc tcttctattg cac 66933 

<210> 12 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 12 

ctgagcggaa ttcgtgagac c 21 

<210> 13 
<211> 23 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> Artificial sequence is a primer. 



<400> 13 

ttggtctcac gtattccgct cga 



23 



<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 14 

ctcgagaatt ctggatcctc 2 0 

<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 



<210> 16 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 16 

tgtatgcgaa ttcgctgcgc g 21 

<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 



<400> 15 

ttgaggatcc agaattctcg ag 



22 



<400> 17 

ttcgcgcagc gaattcgcat aca 



23 



<210> 18 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 18 

gtccactgaa ttctcagtga g 21 

<210> 19 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 19 

ttgtcactga gaattcagtg gac 23 

<210> 20 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 20 

gaatccgaat tcctggtcag c 21 

<210> 21 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 21 

ttgctgacca ggaattcgga ttc 23 



<210> 22 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 22 

cuacuacuac uactgagcgg aattcgtgag acc 

<210> 23 



33 
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<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 23 

cuacuacuac uactcgagaa ttctggatcc tc 32 

<210> 24 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 24 

cuacuacuac uatgtatgcg aattcgctgc gcg 33 

<210> 25 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 25 

cuacuacuac uagtccactg aattctcagt gag 33 

<210> 26 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 26 

cuacuacuac uagaatccga attcctggtc age 33 

<210> 27 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 27 

aactggaaga attcgcggcc gcaggaattt tttttttttt ttttt 45 
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<210> 28 

<211> 13 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Artificial sequence is a primer. 



<400> 28 

aattcggcac gag 13 

<210> 29 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Artificial sequence is a primer. 



<400> 29 
ctcgtgccg 

<210> 30 
<211> 14 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Artificial sequence is a primer. 



<400> 30 

gtacgacggc cagt 14 

<210> 31 

<211> 16 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Artificial sequence is a primer. 
<400> 31 

aacagctatg accatg 16 



<210> 32 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 32 

ccaagttctg agaagtcc 18 
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<210> 33 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 33 

aatacctgaa accatacctg 20 

<210> 34 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 34 

agctgctcgt agctgtctct ccctggatca cgggtacatg tactggacag actgggt 57 

<210> 35 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 35 

tgagacgccc ggattgagcg ggcagggata gcttattccc tgtgccgcat tacggc 56 

<210> 36 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 36 

agctgctcgt agctgtctct ccctgga 27 

<210> 37 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 37 

gccgtaatgc ggcacaggga ataagct 27 
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<210> 38 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 38 

gagaggctat atccctgggc 

<210> 39 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 39 

acagcacgtg tttaaagggg 

<210> 40 

<211> 163 

<212> DNA 

<213> Homo sapiens 



20 



20 



<400> 40 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagct cggcgcgggc ccgtccggcc 60 

gccggacaac atggaggcag ctccgcccgg gccgccgtgg ccgctgctgc tgctgctgct 120 

gctgctgctg gcgctgtgcg gctgcccggc ccccgccgcg gcc 163 

<210> 41 

<211> 419 

<212> DNA 

<213> Homo sapiens 

<400> 41 

gccccacagc ctcgccgctc ctgctatttg ccaaccgccg ggacgtacgg ctggtggacg 60 

ccggcggagt caagctggag tccaccatcg tggtcagcgg cctggaggat gcggccgcag 120 

tggacttcca gttttccaag ggagccgtgt actggacaga cgtgagcgag gaggccatca 180 

agcagaccta cctgaaccag acgggggccg ccgtgcagaa cgtggtcatc tccggcctgg 240 

tctctcccga cggcctcgcc tgcgactggg tgggcaagaa gctgtactgg acggactcag 3 00 

agaccaaccg catcgaggtg gccaacctca atggcacatc ccggaaggtg ctcttctggc 360 

aggaccttga ccagccgagg gccatcgcct tggaccccgc tcacgggtaa accctgctg 419 

<210> 42 
<211> 221 
<212> DNA 

<213> Homo sapiens 
<400> 42 

ccccgtcaca ggtacatgta ctggacagac tggggtgaga cgccccggat tgagcgggca 60 
gggatggatg gcagcacccg gaagatcatt gtggactcgg acatttactg gcccaatgga 120 
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ctgaccatcg acctggagga gcagaagctc tactgggctg acgccaagct cagcttcatc 180 
caccgtgcca acctggacgg ctcgttccgg taggtaccca c 221 

<210> 43 
<211> 221 
<212> DNA 

<213> Homo sapiens 
<400> 43 

tccctgactg caggcagaag gtggtggagg gcagcctgac gcaccccttc gccctgacgc 60 
tctccgggga cactctgtac tggacagact ggcagacccg ctccatccat gcctgcaaca 12 0 
agcgcactgg ggggaagagg aaggagatcc tgagtgccct atactcaccc atggacatcc 180 
aggtgctgag ccaggagcgg cagccttttt gtgagtgccg g 221 

<210> 44 
<211> 156 
<212> DNA 

<213> Homo sapiens 
<400> 44 

tttctcagtc cacactcgct gtgaggagga caatggcggc tggtcccacc tgtgcctgct 60 
gtccccaagc gagccttttt acacatgcgc ctgccccacg ggtgtgcaga tgcaggacaa 120 
cggcaggacg tgtaaggcag gtgaggcggt gggacg 156 

<210> 45 
<211> 416 
<212> DNA 

<213> Homo sapiens 
<400> 45 

ctccacagga gccgaggagg tgctgctgct ggcccggcgg acggacctac ggaggatctc 60 

gctggacacg ccggacttca ccgacatcgt gctgcaggtg gacgacatcc ggcacgccat 12 0 

tgccatcgac tacgacccgc tagagggcta tgtctactgg acagatgacg aggtgcgggc 180 

catccgcagg gcgtacctgg acgggtctgg ggcgcagacg ctggtcaaca ccgagatcaa 240 

cgaccccgat ggcatcgcgg tcgactgggt ggcccgaaac ctctactgga ccgacacggg 300 

cacggaccgc atcgaggtga cgcgcctcaa cggcacctcc cgcaagatcc tggtgtcgga 3 60 

ggacctggac gagccccgag ccatcgcact gcaccccgtg atggggtaag acgggc 416 

<210> 46 
<211> 198 
<212> DNA 

<213> Homo sapiens 
<400> 46 

ttcttctcca gcctcatgta ctggacagac tggggagaga accctaaaat cgagtgtgcc 60 
aacttggatg ggcaggagcg gcgtgtgctg gtcaatgcct ccctcgggtg gcccaacggc 120 
ctggccctgg acctgcagga ggggaagctc tactggggag acgccaagac agacaagatc 180 
gaggtgaggc tcctgtgg 198 

<210> 47 

<211> 244 

<212> DNA 

<213> Homo sapiens 
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<400> 47 

ccgtcctgca ggtgatcaat gttgatggga cgaagaggcg gaccctcctg gaggacaagc 60 

tcccgcacat tttcgggttc acgctgctgg gggacttcat ctactggact gactggcagc 120 

gccgcagcat cgagcgggtg cacaaggtca aggccagccg ggacgtcatc attgaccagc 180 

tgcccgacct gatggggctc aaagctgtga atgtggccaa ggtcgtcggt gagtccgggg 240 

ggtc 244 

<210> 48 

<211> 313 

<212> DNA 

<213> Homo sapiens 



<400> 48 

gttcgcttcc 

tctgcacacc 

tgaagacctg 

ggatctccct 

cctcagccct 

aggtagcgtg 



aggaaccaac 
ccacgcaacc 
catcgtgcct 
cgagaccaat 
ggactttgat 

ggc 



ccgtgtgcgg 
cggtgtggct 
gaggcctttt 
aacaacgacg 
gtgtccaaca 



acaggaacgg 
gccccatcgg 
tggtcttcac 
tggccatccc 
accacatcta 



ggggtgcagc 

cctggagctg 
cagcagagcc 
gctcacgggc 
ctggacagac 



cacctgtgct 
ctgagtgaca 
gccatccaca 
gtcaaggagg 
gtcagcctga 



60 

120 

180 

240 

300 

313 



<210> 49 

<211> 255 

<212> DNA 

<213> Homo sapiens 



<400> 49 

cctgctgcca gaccatcagc cgcgccttca tgaacgggag ctcggtggag cacgtggtgg 60 

agtttggcct tgactacccc gagggcatgg ccgttgactg gatgggcaag aacctctact 120 

gggccgacac tgggaccaac agaatcgaag tggcgcggct ggacgggcag ttccggcaag 180 

tcctcgtgtg gagggacttg gacaacccga ggtcgctggc cctggatccc accaaggggt 240 

aagtgtttgc ctgtc 255 

<210> 50 

<211> 210 

<212> DNA 

<213> Homo sapiens 

<400> 50 

gtgccttcca gctacatcta ctggaccgag tggggcggca agccgaggat cgtgcgggcc 60 

ttcatggacg ggaccaactg catgacgctg gtggacaagg tgggccgggc caacgacctc 120 

accattgact acgctgacca gcgcctctac tggaccgacc tggacaccaa catgatcgag 180 
tcgtccaaca tgctgggtga gggccgggct 210 



<210> 51 

<211> 352 

<212> DNA 

<213> Homo sapiens 



<400> 51 

gtgttcatgc aggtcaggag cgggtcgtga ttgccgacga tctcccgcac ccgttcggtc 60 

tgacgcagta cagcgattat atctactgga cagactggaa tctgcacagc attgagcggg 120 

ccgacaagac tagcggccgg aaccgcaccc tcatccaggg ccacctggac ttcgtgatgg 180 

acatcctggt gttccactcc tcccgccagg atggcctcaa tgactgtatg cacaacaacg 24 0 

ggcagtgtgg gcagctgtgc cttgccatcc ccggcggcca ccgctgcggc tgcgcctcac 300 
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actacaccct ggaccccagc agccgcaact gcagccgtaa gtgcctcatg gt 352 

<210> 52 

<211> 225 

<212> DNA 

<213> Homo sapiens 

<400> 52 

gcctcctcta cgcccaccac cttcttgctg ttcagccaga aatctgccat cagtcggatg 60 

atcccggacg accagcacag cccggatctc atcctgcccc tgcatggact gaggaacgtc 120 

aaagccatcg actatgaccc actggacaag ttcatctact gggtggatgg gcgccagaac 18 0 

atcaagcgag ccaaggacga cgggacccag gcaggtgccc tgtgg 225 

<210> 53 

<211> 235 

<212> DNA 

<213> Homo sapiens 

<400> 53 

ctttgtctta cagccctttg ttttgacctc tctgagccaa ggccaaaacc cagacaggca 60 

gccccacgac ctcagcatcg acatctacag ccggacactg ttctggacgt gcgaggccac 12 0 

caataccatc aacgtccaca ggctgagcgg ggaagccatg ggggtggtgc tgcgtgggga 180 

ccgcgacaag cccagggcca tcgtcgtcaa cgcggagcga gggtaggagg ccaac 235 

<210> 54 

<211> 218 

<212> DNA 

<213> Homo sapiens 

<400> 54 

ccaccctccc gcaggtacct gtacttcacc aacatgcagg accgggcagc caagatcgaa 60 

cgcgcagccc tggacggcac cgagcgcgag gtcctcttca ccaccggcct catccgccct 12 0 

gtggccctgg tggtggacaa cacactgggc aagctgttct gggtggacgc ggacctgaag 18 0 

cgcattgaga gctgtgacct gtcaggtacg cgccccgg 218 

<210> 55 

<211> 234 

<212> DNA 

<213> Homo sapiens 

<400> 55 

ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 60 

gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 120 

gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 180 

ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggc 234 

<210> 56 

<211> 157 

<212> DNA 

<213> Homo sapiens 

<400> 56 

ttgtctttgc agcagcccac ccatgtgccc gtgacaatgg tggctgctcc cacatctgta 60 
ttgccaaggg tgatgggaca ccacggtgct catgcccagt ccacctcgtg ctcctgcaga 120 
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acctgctgac ctgtggaggt aggtgtgacc taggtgc 157 



<210> 57 

<211> 272 

<212> DNA 

<213> Homo sapiens 



<400> 57 

gttctcctct gtccctcccc cagagccgcc cacctgctcc ccggaccagt ttgcatgtgc 60 

cacaggggag atcgactgta tccccggggc ctggcgctgt gacggctttc ccgagtgcga 12 0 

tgaccagagc gacgaggagg gctgccccgt gtgctccgcc gcccagttcc cctgcgcgcg 18 0 

gggtcagtgt gtggacctgc gcctgcgctg cgacggcgag gcagactgtc aggaccgctc 240 

agacgaggtg gactgtgacg gtgaggccct cc 2 72 

<210> 58 
<211> 134 
<212> DNA 

<213> Homo sapiens 



<400> 58 

tctccttgca gccatctgcc tgcccaacca gttccggtgt gcgagcggcc agtgtgtcct 60 

catcaaacag cagtgcgact ccttccccga ctgtatcgac ggctccgacg agctcatgtg 12 0 
tggtgagcca gctt 134 

<210> 59 

<211> 274 

<212> DNA 

<213> Homo sapiens 



<400> 59 

gtttgtctct ggcagaaatc accaagccgc cctcagacga cagcccggcc cacagcagtg 60 

ccatcgggcc cgtcattggc atcatcctct ctctcttcgt catgggtggt gtctattttg 120 

tgtgccagcg cgtggtgtgc cagcgctatg cgggggccaa cgggcccttc ccgcacgagt 180 

atgtcagcgg gaccccgcac gtgcccctca atttcatagc cccgggcggt tcccagcatg 240 

gccccttcac aggtaaggag cctgagatat ggaa 274 

<210> 60 
<211> 164 
<212> DNA 

<213> Homo sapiens 
<400> 60 

cttccctgcc aggcatcgca tgcggaaagt ccatgatgag ctccgtgagc ctgatggggg 60 
gccggggcgg ggtgcccctc tacgaccgga accacgtcac aggggcctcg tccagcagct 120 
cgtccagcac gaaggccacg ctgtacccgc cggtgagggg cggg 164 

<210> 61 
<211> 130 
<212> DNA 

<213> Homo sapiens 



<400> 61 

ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc tccctgtaca 60 
acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac aggtaggaca 120 



406- 



WO 02/092000 



PCT/US02/14877 



tcccctgcag 



130 



<210> 62 

<211> 496 

<212> DNA 

<213> Homo sapiens 

<400> 62 

tcaaacattc cggccactgc gagaccgtac aggccctaca tcattcgagg aatggcgccc 60 

ccgacgacgc cctgcagcac cgacgtgtgt gacagcgact acagcgccag ccgctggaag 120 

gccagcaagt actacctgga tttgaactcg gactcagacc cctatccacc cccacccacg 180 

ccccacagcc agtacctgtc ggcggaggac agctgcccgc cctcgcccgc caccgagagg 240 

agctacttcc atctcttccc gccccctccg tccccctgca cggactcatc ctgacctcgg 3 00 

ccgggccact ctggcttctc tgtgcccctg taaatagttt taaatatgaa caaagaaaaa 3 60 

aatatatttt atgatttaaa aaataaatat aattgggatt ttaaaaacat gagaaatgtg 420 

aactgtgatg gggtgggcag ggctgggaga actttgtaca gtggagaaat atttataaac 4 80 

ttaattttgt aaaaca 496 



<210> 63 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 



<210> 64 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
<400> 64 

aaaactgtgg gtgcttctgg 20 

<210> 65 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 



<400> 63 

ttttgggtac acaattcagt eg 



22 



<400> 65 

gtgattgagc caatcctgag a 



21 



<210> 66 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
<400> 66 

tgagccaaat aaaccccttc t 21 

<210> 67 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 67 

ctggactacg tggccttctc 20 

<210> 68 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 68 

ttcagaagca cttggctgg 19 

<210> 69 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 69 

ctcagtgcca tgaagatgga 2 0 

<210> 70 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 70 

caagatcact cgatctccag g 21 

<210> 71 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 71 

gtttcaggag actcagagtc 20 

<210> 72 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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<400> 72 

ttctgcaggt tgctgttgag 20 

<210> 73 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 73 

ttattgtgat ttcccgtggc 2 0 

<210> 74 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 74 

gccctctgtc ctgacttcag g 21 

<210> 75 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 75 

gagaaagaaa taaggggacc 20 

<210> 76 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 76 

tgctttgtaa agcactgaga 2 0 

<210> 77 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 77 

gaagtacggg cagttcagtg gcct 24 

<210> 78 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 78 

atacaccaag gtccatgttc cccgt 25 

<210> 79 
<211> 25 
<212> DNA 
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<213> Homo sapiens 
<400> 79 

agcctgggcc acagcgtgag actac 25 

<210> 80 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 80 

tcccggagct tgcacacccg cttca 25 

<210> 81 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 81 

catgtgccca cctcattcat 20 

<210> 82 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 82 

caagattctg tagcttctgg 2 0 

<210> 83 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 83 

cagagaagtc aagggacttg 20 

<210> 84 

<211> 20 

<212> DNA 

<:213> Homo sapiens 

<400> 84 

atcctctcac atcccacact 2 0 

<210> 85 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 85 

caaggctaaa agacgaaaaa 2 0 

<210> 86 
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<212> DNA 

<213> Homo sapiens 

<400> 86 

tcaggagcat ttcatctttt 

<210> 87 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 87 

aagtcgaggc tgcaaggag 

<210> 88 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 88 

gccctgtgtt cctttcagta 

<210> 89 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 89 

aaggtgtgag gatcactgg 

<210> 90 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 90 

agctcatggg ggctatt 

<210> 91 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 91 

gcttctccga gtgtatcaac 

<210> 92 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 92 

atggcagagg acttagaaca 



PCT/US02/14877 



20 



19 



20 



19 



17 



20 
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<210> 93 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 93 

gatcagcgaa cttcctctcg gctc 24 

<210> 94 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 94 

tccacattga ggactgtggg aacg 24 

<210> 95 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 95 

gctaatcaca gtctaaccga 20 

<210> 96 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 96 

ttgcactgtc ttggatgca 19 

<210> 97 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 97 

gcacagctgt agtggggttc taggc 25 

<210> 98 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 98 

caggcgcaaa ggacatgcac acggc 25 

<210> 99 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 99 
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caccgatgag tgcacgttca aggag 25 

<210> 100 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 100 

cagacagaga tgctccacgc catac 25 

<210> 101 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 101 

tttctgggtg tgtctgaat 19 

<210> 102 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 102 

acacagttgc tctaaagggt 2 0 

<210> 103 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 103 

catttgggaa atccagaaga 2 0 

<210> 104 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 104 

taggtgtctt attttttgtt gcttc 25 

<210> 105 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 105 

gacataccat gaacactata agagg 2 5 

<210> 106 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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<400> 106 

caacccatac cagggataag 2 0 

<210> 107 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 107 

gaacaagagg ggtaagttgg c 21 

<210> 108 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 108 

tgaggacaca gatactgatg gg 22 

<210> 109 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 109 

gaagtgttcc ctcttaaatt ctttg 25 

<210> 110 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 110 

gaactatatt gtagttagtg aggag 25 

<210> 111 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 111 

cctgtaaccc ccagtccc . 18 

<210> 112 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 112 

tcttgcttcc taagtttctc gg 22 



<210> 113 
<211> 21 
<212> DNA 
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<213> Homo 



sapiens 



<400> 113 
actccatcca 



cctcatcact g 



21 



<210> 114 
<211> 20 
<212> DNA 
<213> Homo 



sapiens 



<400> 114 
tgctgtttgc 



ctcatctgac 



20 



<210> 115 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 115 

gtggacaggc atagctgagg 2 0 

<210> 116 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 116 

tgttcactct tctgcctgca g 21 

<210> 117 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 117 

agctggactc tcacagaatg 20 

<210> 118 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 118 

caagaggctg gtagaaggtg 20 

<210> 119 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 119 

gactccagtc tgggcaataa aagc 24 
<210> 120 
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<211> 22 
<212> DNA 
<213> Homo 



sapiens 



<400> 120 
ggtggcagca 



tgacctctaa ag 



22 



<210> 121 
<211> 16 
<212> DNA 

<213> Homo sapiens 
<400> 121 

caggcccagt ctcttg 16 

<210> 122 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 122 

cgtgtccaga tgaaagtg 18 

<210> 123 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 123 

acctcacggt gtaatccc 18 

<210> 124 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 124 

cttgaagccc atctttgc 18 

<210> 125 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 125 

tatttgcaaa gcttgagact tct 23 

<210> 126 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 126 

aatcactgtg ctttgttgcc 20 
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<210> 127 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 127 

actttattgt cagcgtgggc 20 

<210> 128 

<211> 18 

<212> DNA 

<213> Homo, sapiens 

<400> 128 

actccctcga tggcttcc 18 

<210> 129 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 129 

gagcagggga gagaaggc 18 

<210> 130 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 130 

cccactggct tgttttattg 2 0 

<210> 131 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 131 

agccacttta ttgttatttt gatgc 25 

<210> 132 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 132 

aagagtgaac aaaagcaaac atacc 25 

<210> 133 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 133 
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gtggagtgtg ggattggg 18 

<210> 134 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 134 

tactgttctt gataagtatg tcggc 25 

<210> 135 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 135 

atgcttttgc atgattctaa ttatt 25 

<210> 136 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 136 

tcccccaaaa gaatgtaaag g 21 

<210> 137 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 137 

ctggtcttcc ttgtgtgctg 20 

<210> 138 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 138 

atcacccagg ccagggat 18 

<210> 139 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 139 

tcagaagcag aactgttttt aaca 24 

<210> 140 

<211> 22 

<212> DNA 

<213> Homo sapiens 
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<400> 140 

cctgcttgaa agttctagag cc 22 

<210> 141 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 141 

caagcccggg ttttattgaa a 21 

<210> 142 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 142 

gatgccagga ccatggac 18 

<210> 143 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 143 

gcatatagaa acaatttatt gccg 24 

<210> 144 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 144 

ctctgaagca gggaccagag 2 0 

<210> 145 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 145 

ctaccacacc acaccaggc 19 

<210> 146 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 146 

caagcgaaag ctgccttc 18 

<210> 147 
<211> 23 
<212> DNA 
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<213> Homo sapiens 
<400> 147 

gttgtcttga cttcaggtct gtc 23 

<210> 148 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 148 

ttttccttca acaatcacta ctcc 24 

<210> 149 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 149 

gcgtggggat atagaggtca 2 0 

<210> 150 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 150 

tacgtggcca agaagctagg 20 

<210> 151 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 151 

taatatatcc ccagtctaag gcat 24 

<210> 152 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 152 

agcttgcaga tggagccc 18 

<210> 153 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 153 

tggttttaaa cctttaatga gaaaa 25 
<210> 154 



-120- 



WO 02/092000 



PCT/US02/14877 



<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 154 

tgttgatcta taccctgttt ccg 

<210> 155 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 155 

aattatttaa aagagaggaa aggca 

<210> 156 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 156 

tggctgtgaa cttcctctga 

<210> 157 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 157 

ggttacagaa aaacatttga gagat 

<210> 158 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 158 

tgagctttag ttcccttctc tg 

<210> 159 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 159 

ttgaaaaacc atttatttca ccg 

<210> 160 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 160 

tctgcggctg ttggattt 



25 



20 



25 



22 



23 



18 
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<210> 161 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 161 

ttgaaaaacc atttatttca ccg 23 

<210> 162 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 162 

tgttctcttc tcccagcagg 20 

<210> 163 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 163 

ctttattgaa aacattgagt gca 23 

<210> 164 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 164 

ttgtcaaatt ccccccaaaa 20 

<210> 165 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 12 

<223> n « A,T,C or G 
<400> 165 

aaaccacgac cnccaa 16 

<210> 166 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 166 

ccctggaaag gtaagatgct 2 0 

<210> 167 



-122- 



WO 02/092000 



PCT7US02/14877 



<211> 23 
<212> DMA 

<213> Homo sapiens 
<400> 167 

cttttggtag agacaaggtc tea 

<210> 168 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 168 

tatctgtctg tagtgettea aatgt 

<210> 169 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 169 

gacgaaggtg attcagggc 

<210> 170 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 170 

actgaagaac tcttgtcct 

<210> 171 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 171 

cagataaaag agtcactatg gctca 

<210> 172 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 172 

cacttctccc actttgtccc 

<210> 173 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 173 

ttattgataa gcattagtga acccc 



25 



19 



19 



25 



20 



25 
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<210> 174 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 174 

tggcaagtta ggcacagtca 

<210> 175 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 175 

ctatgcccag agatgaacag g 

<210> 176 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 176 

tccactaagg gctatgtcgc 

<210> 177 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 177 

gccagcttta ttgagtaaac ttcc 

<210> 178 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 178 

cactggagac tacaagtggt gg 

<210> 179 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 179 

catcccaacc atcactcagt 

<210> 180 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 180 
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20 



21 



20 



24 



-124- 



WO 02/092000 



PCT/US02/14877 



ggggactagc ttacagattt ga 



22 



<210> 181 

<211> 23 

<212> DNA 

<213> Homo sapiens 



<400> 181 

agactacatt ttggaaccag tgg 



23 



<210> 182 

<211> 23 

<212> DNA 

<213> Homo sapiens 



<400> 182 

tgaaaggata tttatagcct gga 



23 



<210> 183 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 183 

gaaggttttg tccctcgatc 



20 



<210> 184 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 184 

tgagggttgg gaagatcata 



20 



<210> 185 

<211> 18 

<212> DNA 

<213> Homo sapiens 



<400> 185 

ccttcatagc cacacccg 



18 



<210> 186 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 186 

cagctaactg ttgacatgcc a 



21 



<210> 187 

<211> 25 

<212> DNA 

<213> Homo sapiens 
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<400> 187 

tctttactgt gcttacaact ttcct 25 

<210> 188 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 188 

caacagtgca gtcggtatcg 2 0 

<210> 189 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 189 

agatcagcaa gcagatag 18 

<210> 190 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 190 

cattccacat ggatagac 18 

<210> 191 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 191 

catacctatg aggtgtgcta cagg 24 

<210> 192 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 192 

gcattttctc atcatccttg c 21 

<210> 193 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 193 

ttacagccac caaggtttcc 20 



<210> 194 
<211> 20 
<212> DNA 
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<213> Homo 



sapiens 



<400> 194 
aggtgtgtgt 



gccaggttga 



20 



<210> 195 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400:> 195 

cactgttatc tcattaactg tgagg 25 

<210> 196 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 196 

tttgattttg tgtctcccaa a 21 

<210> 197 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 197 

ccccactccc acttttattt 20 

<210> 198 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 198 

ccagtcacct ttactagtcc tfctg 24 

<210> 199 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 199 

aggacacagc ctgcatctag 2 0 

<210> 200 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 200 

accaggcatt gcactaaaag 2 0 

<210> 201 
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<211> 22 
<212> DNA 
<213> Homo 



sapiens 



<400> 201 
gatgggtcac 



actaacctgt ca 



22 



<210> 202 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 202 

acatttatat ttggacatgc aacc 24 

<210> 203 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 203 

agcatcttta atgtgtcagg ca 22 

<210> 204 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 204 

atgtgctggg ctggaaag 18 

<210> 205 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 205 

tcacattcaa aaatcggcaa 20 

<210> 206 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 206 

ctgcctgtgt ggtgtcgc 18 

<210> 207 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 207 

tgttttattt ctcagtacaa agcca 2 5 
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<210> 208 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 208 

gacctcctgt gacaccacg 19 

<210> 209 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 209 

ccaccaaatt atttatagtt ctgcg 25 

<210> 210 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 210 

gtaagattct ccactgttgc acc 2 3 

<210> 211 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 211 

cctataatgg gctggaccaa 2 0 

<210> 212 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 212 

actcctcatg tgaagtcacc g 21 

<210> 213 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 213 

cagtgtgcac gttttcattt 2 0 

<210> 214 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 214 
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cagcatcttc agcacttacc 



20 



<210> 215 

<211> 25 

<212> DNA 

<213> Homo sapiens 



<400> 215 

ctgcatttat tatgagaatc aacag 



25 



<210> 216 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 216 

tgctgctggg agtcagagtc 



20 



<210> 217 

<211> 23 

<212> DNA 

<213> Homo sapiens 



<400> 217 

cagggcactg agatacactt acc 



23 



<210> 218 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 218 

aaggatcaag ccaggcattt g 



21 



<210> 219 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 219 

acacatctct tctgtgcccc 



20 



<210> 220 
<211> 18 
<212> DNA 

<213> Homo sapiens 



<400> 220 

tgaaccctgg aggcagag 

<210> 221 

<211> 19 

<212> DNA 

<213> Homo sapiens 



18 
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<400> 221 

cattccccag tttgcagac 19 

<210> 222 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 222 

gtgctgggat tacaggtgt 19 

<210> 223 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 223 

gcagagaagt cctgttagcc 2 0 

<210> 224 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 224 

ccatgctaga gaagcacaac 2 0 

<210> 225 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 225 

agtgtggggc aggacctctg 20 

<210> 226 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 226 

cagacagata gccctgggtt c 21 

<210> 227 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 227 

tccctcatcc ccttgtctgt 2 0 

<210> 228 
<211> 19 
<212> DNA 
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<213> Homo sapiens 
<400> 228 

agcccccctg gggataatc 19 

<210> 229 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 229 

gatgcttacc taccacggc 19 

<210> 230 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 230 

aggattccta tctgggctat g 21 

<210> 231 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 231 

tggcagacca tgctccgcct 2 0 

<210> 232 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 232 

gagaaggccg ggaggctctg 2 0 

<210> 233 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 233 

ctccatcaca accagatttg aggct 25 

<210> 234 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 234 

gggtgtgagc tgctgctgaa gg 22 
<210> 235 
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<211> 25 
<212> DNA 
<213> Homo 



sapiens 



<400> 235 
agtgggaaac 



ctcaggtagc tcccg 



25 



<210> 236 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 236 

cagtttggct cagacatatg ggggc 25 

<210> 237 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 237 

cattagtagt ggggggacag 2 0 

<210> 238 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 238 

caaagcgaca gtgagttagg g 21 

<210> 239 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 239 

ggagtagacc atgattactg 2 0 

<210> 240 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 240 

catggtctat ttattctcg 19 

<210> 241 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 241 

cgccctggat cctcacacta ca 22 
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<210> 242 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 242 

gggcatcagg ggatgggtag a 

<210> 243 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 243 

gctcctatct gtgttttgaa tgg 

<210> 244 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 244 

ccgtggcata gataagtaaa eg 

<210> 245 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 245 

ettggagege tatgaggagg gc 

<210> 246 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 246 

atggcaactg accttccgtc ctg 

<210> 247 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 247 
ttggagtcac aggggc 

<210> 248 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 248 
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21 



23 



22 



22 



23 
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cagcactatc cttgggg 17 

<210> 249 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 249 

aacaaagctg cttagcacct g 21 

<210> 250 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 250 

gatgaggacc aactggtgac 2 0 

<210> 251 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 251 

ttttccaata atgtgacttc 20 

<210> 252 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 252 

caatcccaac cgtaacaggc 20 

<210> 253 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 253 

cttgatctcg cccaggaac 19 

<210> 254 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 254 

gctcgctgaa ggatgaagac 20 

<210> 255 

<211> 17 

<212> DNA 

<213> Homo sapiens 
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<400> 255 

gaatcgcttg aacccag 17 

<210> 256 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 256 

ccaggtggtc ttaacgg " 17 

<210> 257 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 8 

<223> n = A,T,C or G 
<400> 257 

gaacgttntt catgtaggcg t 21 

<210> 258 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 258 

taatggtcgc tgtccc 16 

<210> 259 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 259 

agggaaaatg gtatgtgggg ag 22 

<210> 260 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 260 

gcagtgtgtg aaggcagg 18 

<210> 261 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 261 

agtggacaaa atgaggaaaa cagg 24 
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<210> 262 

<211> 23 

<212> DMA 

<213> Homo sapiens 

<400> 262 

ccaacacagt ttgctcacat gcc 2 3 

<210> 263 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 263 

tgacatcttt gcattatggc 2 0 

<210> 264 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 264 

agttatccca cctgataccg 2 0 

<210> 265 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 265 

agctcttgct tctcagtcca 2 0 

<210> 266 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 266 

caaaagttgt ttctgtgttt gttc 24 

<210> 267 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 267 

gcctctcaaa gtagttggaa cc 22 

<210> 268 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 268 
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tgtgtatcca tagtgcaaaa cag 23 

<210> 269 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 269 

ctcaaggcca ggcatcact 19 

<210> 270 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 270 

ggactcttcc atgccagtg 19 

<210> 271 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 271 

aatgatgatc tcaactctg 19 

<210> 272 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 272 

actgaagaac tcttgtcct 19 

<210> 273 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 273 

gacatctgtt agtctcataa ttc 23 

<210> 274 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 274 

ggtaacagtg tcttgctt 18 

<210> 275 
<211> 21 
<212> DNA 

<213> Homo sapiens 
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<400> 275 

ctatgtacaa aacaggaaga g 21 

<210> 276 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 276 

atcctagttt cctctcctt 19 

<210> 277 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 277 

gtaaatgaga aacagacaaa tga 23 

<210> 278 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 278 

ctattggatg tgatatgtta tgg 23 

<210> 279 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 279 

aagtagaaac aaaatgaggg ac 22 

<210> 280 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 280 

cctaccccaa ggtaacag 18 

<210> 281 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 281 

acttcctata aatggaggtg ag 22 



<210> 282 
<211> 18 
<212> DNA 
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<213> Homo 



sapiens 



<400> 282 
gaggagcttc 



aagaggaa 



18 



<210> 283 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 283 

catactccta gactcaagga ate 2 3 

<210> 284 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 284 

gaatgatgta catgaattct ttg 23 

<210> 285 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 285 

gtgttgagga gaaaagcact 2 0 

<210> 286 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 286 

ctcccagtag tcacattcc 19 

<210> 287 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 287 

caagttacaa ataacttaag ccg 23 

<210> 288 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 288 

caagacccta tctctacaaa aac 23 
<210> 289 
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<211> 24 
<212> DNA 
<213> Homo 



sapiens 



<400> 289 
tttattagaa 



gtgactcttg gccc 



24 



<210> 290 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 290 

gactacctgc cctcagcttg 20 

<210> 291 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 291 

ttctcatgta caaagcggtc 2 0 

<210> 292 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 292 

ccactggctt ctctcttttt 20 

<210> 293 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 293 

caccagaagg ttggggtg 18 

<210> 294 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 294 

actattacga catgaacgcg g 21 

<210> 295 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 295 

ctcatgctgg atgacccc 18 
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<210> 296 
<211> 24 
<212> DNA 

<213> Homo sapiens 



<400> 296 

ttgcctttct tgaaacttaa ttcc 



24 



<210> 297 
<211> 19 
<212> DNA 

<213> Homo sapiens 



<400> 297 

tcacagcctt cagtcaggg 



19 



<210> 298 

<211> 18 

<212> DNA 

<213> Homo sapiens 



<400> 298 

acatgctgtg gcaccatg 



18 



<210> 299 

<211> 18 

<212> DNA 

<213> Homo sapiens 



<400> 299 

cctgagctac tgccacag 



18 



<210> 300 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 300 

ccctgacttg gacagtgtcc 



20 



<210> 301 

<211> 18 

<212> DNA 

<213> Homo sapiens 



<400> 301 

tcagagtcac tcctgccc 



18 



<210> 302 

<211> 22 

<212> DNA 

<213> Homo sapiens 



<400> 302 
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caaattcaag ctcatccaga cc 22 

<210> 303 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 303 

cggcatttca tccaggac 18 

<210> 304 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 304 

ggtgtaggag gtgcgacaat 20 

<210> 305 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 305 

ttccatttat tgagcacctg 20 

<210> 306 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 306 

cttaagccac tgtgttttgg 2 0 

<210> 307 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 307 

cctcctacac ctgcaaaagc 20 

<210> 308 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 308 

tggaagaacc ccagaggac 19 

<210> 309 

<211> 23 

<212> DNA 

<213> Homo sapiens 
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<400> 309 

aaagcacaaa agtaacagca aca 

<210> 310 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 310 

gtgtgtgggc cacaatattg 

<210> 311 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 311 

agagcacctt tcctcagcac 

<210> 312 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 312 

agaatctcat cacaggggcg 

<210> 313 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 313 

aaaaaggaca gtgtctaaaa tttga 

<210> 314 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 314 

aattgttttt gtttgttttt tgagt 

<210> 315 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 315 

gatttaggga gtacaagtgc gg 

<210> 316 
<211> 25 
<212> DNA 
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23 



20 



20 



20 



25 



25 
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<213> Homo sapiens 
<400> 316 

ggggacaaat tatactttat tcagg 25 

<210> 317 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 317 

ccatcatcat attggtgtga cc 22 

<210> 318 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 318 

tggctgccca agaagaag 18 

<210> 319 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 319 

ttaagatgcc attaaactca tgac 24 

<210> 320 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 320 

ccaaggagat gaccaagtgg 20 

<210> 321 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 321 

ccatctcttt tatcagggtt gg 22 

<210> 322 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 322 

ctctgtgcaa gtaagcatct taca 24 

<210> 323 
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<211> 20 
<212> DNA 
<213> Homo 



sapiens 



<400> 323 
cgactgtgta 



ttttccacag 



20 



<210> 324 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 324 

agaagcccat atcaatgcac 20 

<210> 325 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 325 

agcttaaagt aggacaacca tgg 23 

<210> 326 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 326 

ggatgcttca ctccagaaag 2 0 

<210> 327 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 327 

tgttgtttat ttccacctgc c 21 

<210> 328 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 328 

agagtggctg caggccag 18 

<210> 329 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 329 

tttttttttt tacacgaatt tgagg 25 
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<210> 330 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 330 

tgaggaagta aaaacaggtc ate 23 

<210> 331 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 331 

atgaaatctt aagcagaatc cca 2 3 

<210> 332 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 332 

cacagagtcc cagggtctgt 20 

<210> 333 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 333 

aaaggccttt atttatctct ctctg 25 

<210> 334 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 334 

gcctcagagc tggfcgggt 18 

<210> 335 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 335 

gcttctaagt cttagagtca gctgg 2 5 

<210> 336 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 336 



-147- 



WO 02/092000 



PCT/US02/14877 



agcccacagt cagcctacc 19 

<210> 337 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 337 

ttggttaaat gatgcccaga 2 0 

<210> 338 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 338 

tggtcccact cacatccc 18 

<210> 339 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 339 

acacagcatg cagggagag 19 

<210> 340 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 340 

atccctggtg cttaggtgg 19 

<210> 341 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 341 

gatggaagta gctcctctcg g 21 

<210> 342 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 342 

ggaaggccag caagtactac c 21 

<210> 343 

<211> 18 

<212> DNA 

<213> Homo sapiens 
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<400> 343 

ccggtgcttg gaaagatg 18 

<210> 344 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 344 

gaagtgtctc tgttggggga 2 0 

<210> 345 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 345 

ttacaggcat gagtcactac gc 22 

<210> 346 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 346 

accactctca cagcccttac a 21 

<210> 347 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 347 

ccctccctcc acacacac 18 

<210> 348 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 348 

gctcactgaa ctttcagggc 2 0 

<210> 349 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 349 

agatacgggc aaaacactgg 2 0 

<210> 350 
<211> 21 
<212> DNA 
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<213> Homo 



sapiens 



<400> 350 
gttgaatata 



gagcagggcc c 



21 



<210> 351 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 351 

ttctgaggtc agggctgtct 2 0 

<210> 352 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 352 

agcttggaaa atctcgtgtc a 21 

<210> 353 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 353 

actcagtccc tcccaccc 18 

<210> 354 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 354 

tcctctcact ccttcccaga 20 

<210> 355 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 355 

gtgatcacgg ctcaacctg 19 

<210> 356 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 356 

tggaggactg cttgagcc 18 
<210> 357 
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<211> 19 
<212> DNA 
<213> Homo 



sapiens 



<400> 357 
ctgcagctgc 



ctcagtttc 



19 



<210> 358 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 358 

tcaaaagtgc tggtgacagc 20 

<210> 359 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 359 

atttccagag ccagctcaaa 2 0 

<210> 360 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 360 

ctttaatgtt gtgatgacac aaagc 25 

<210> 361 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 361 

gatcatgcac tgttgaccac 20 

<210> 362 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 362 

tacatttgaa acatttaaaa cctga 25 

<210> 363 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 363 

aactgagctg taaccagact ggga 24 
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<210> 364 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 364 

tggaacagtc tggtcctgat gg 22 

<210> 365 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 365 

ttatcccttt attgtttctc ctttg 25 

<210> 366 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 366 

tggtcacctg tatttattgc tagg 24 

<210> 367 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 367 

tcttcaaagc ctctgcagta cc 22 

<210> 368 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 368 

ctcatctcca acctgtctaa cc 22 

<210> 369 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 369 

gtggctgcag ctaatgtaag acac 24 

<210> 370 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 370 
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cagcagagac aatggcgtaa gtcc 24 

<210> 371 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 371 

ctgattgaga accagaacag 20 

<210> 372 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 372 

taaagcccta taacctctcc 20 

<210> 373 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 373 

tagtaaggga ccttcaccag 2 0 

<210> 374 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 374 

agatgtttgg tatgacttgg 20 

<210> 375 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 375 

gatgattaaa ctctcctggc 20 

<210> 376 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 376 

gagacagcta agcactcatg 2 0 

<210> 377 

<211> 19 

<212> DNA 

<213> Homo sapiens 
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<400> 377 

gaggtggtgg gcacctgta 19 

<210> 378 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 378 

agaggggagg aacacacctt 20 

<210> 379 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 379 

gaccagagtc tgcccagaag 2 0 

<210> 380 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 380 

tccccagctc tatcccaac 19 

<210> 381 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 381 

ggagggatgg acaagtctga 20 

<210> 382 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 382 

gtccagctcg ctgactatcc 20 

<210> 383 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 383 

tcaaaacaca gtcatctcca 2 0 

<210> 384 
<211> 20 
<212> DNA 
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<213> Homo sapiens 
<400> 384 

gcaaaggctt fcaccatattg 

<210> 385 
<211> 16 
<212> DNA 

<213> Homo sapiens 

<400> 385 
gctcagcacc cccatt 

<210> 386 
<211> 16 
<212> DNA 

<213> Homo sapiens 

<400> 386 
tccctgctcg ggaaac 

<210> 387 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 387 

gttctccaga gagacagcac 

<210> 388 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 388 

gagagcaaca ctattgccc 

<210> 389 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 389 

tatagacttc agccctgctg c 

<210> 390 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 390 

cctctgtagg atgcagttgg 
<210> 391 



20 



16 



16 



20 



19 



21 



20 
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<211> 20 
<212> DNA 
<213> Homo 



sapiens 



<400> 391 
ttgctacgca 



ctcctctact 



20 



<210> 392 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 392 

gtgaaggcag gaaatgtgac 2 0 

<210> 393 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 393 

atcctagacc agaggagccc 20 

<210> 394 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 394 

ctccccctgg tccagttatt 2 0 

<210> 395 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 395 

aactttcatt tgccaaggga 20 

<210> 396 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 396 

agcagatctg ctcttgcgat 2 0 

<210> 397 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 397 

acagttgtca tcggtaggca 20 
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<210> 398 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 398 

aaaagtatga atgggatgga gc 

<210> 399 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 399 

gtgcaggtgg cgtttatttt 

<210> 400 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 400 

ccctatatct ccgtgtgctc c 

<210> 401 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 401 

gctctagtgg gaaacctcag g 

<210> 402 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 402 

gaattccagg ctcttgcttg 

<210> 403 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 403 

ggtttggtct caaaggcaaa 

<210> 404 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 404 
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22 



20 



21 



21 



20 
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ccagtacatg gtggtcacca 2 0 

<210> 405 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 405 

gctgccttgg aatttctgtt 20 

<210> 406 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 406 

gtgctgtggt ggggaaag 18 

<210> 407 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 407 

attcaagctc atccagaccc 20 

<210> 408 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 408 

ggactggccc tttgaaactc 20 

<210> 409 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 409 

atattgaccg tgcacaaata eg 22 

<210> 410 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 410 

agacctggga aaagtggaga a 21 

<210> 411 

<211> 20 

<212> DNA 

<213> Homo sapiens 



-158- 



WO 02/092000 



PCT/US02/14877 



<400> 411 

attggcagtg gaaaatgctt 20 

<210> 412 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 412 

ttaatctttt gtcaacttcc tgatt 25 

<210> 413 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 413 

tctgtcctcc tttcaccgga age 23 

<210> 414 

<211> 29 

<212> DNA 

<213> Homo sapiens 

<400> 414 

ggataaagaa actccgctct gctggtaga 29 

<210> 415 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 415 

teagggectg tgttgccgca ctctg 25 

<210> 416 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 416 

agcgatgtaa agggtaccag tgccg 25 

<210> 417 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 417 

aggcatgeaa gcttctta 18 

<210> 418 
<211> 18 
<212> DNA 
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<213> Homo 



sapiens 



<400> 418 
ccgggaggag 



acatctat 



18 



<210> 419 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 419 

tggtaagcac agaaaatgc 19 

<210> 420 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 420 

aatggatggg ggattatt 18 

<210> 421 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 421 

ctggacgtta tgtctgcc 18 

<210> 422 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 422 

agaggcccag tcacagat 18 

<210> 423 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 423 

atcactctga actgccact 19 

<210> 424 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 424 

cccttctgtt tttctgtttt 20 

<210> 425 
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<211> 18 
<212> DNA 
<213> Homo 



sapiens 



<400> 425 
caagctttga 



aggaagag 



18 



<210> 426 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 426 

taggacgtta agtgaggac 19 

<210> 427 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 427 

gctctgcagt gggtaaaa 18 

<210> 428 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 428 

actctccaag actgtgcg 18 

<210> 429 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 429 

ccctttctga ggcaagat 18 

<210> 430- 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 430 

gaccacctgg gagagaac 18 

<210> 431 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 431 

cgctatgagt cccatctg 18 
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<210> 432 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 432 

gatcagctgc aatgaagg 18 

<210> 433 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 433 

ttgagtacac ggggtgac 18 

<210> 434 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 434 

cgcaggactg aaagatga 18 

<210> 435 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 435 

acctgtctcc tctcctgg 18 

<210> 436 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 436 

tgcttttctt ctgtggga 18 

<210> 437 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 437 

atgaccagca agcattgt 18 

<210> 438 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 438 
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gtactgggat tacaggcg 18 

<210> 439 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 439 

gcagaaggtc ctttggat 18 

<210> 440 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 440 

tttgcaggat tcatgctt 18 

<210> 441 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 441 

cgacattctt ttctggagg 19 

<210> 442 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 442 

acctttgcat gttggtttt 19 

<210> 443 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 443 

gcacttttcc fctccttcc 18 

<210> 444 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 444 

tgctttgctt tcttctgg 18 

<210> 445 

<211> 19 

<212> DNA 

<213> Homo sapiens 
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<400> 445 

acagctccag agagaagga 



19 



<210> 446 

<211> 19 

<212> DNA 

<213> Homo sapiens 



<400> 446 

gcagtcactt gaaaccaga 



19 



<210> 447 

<211> 18 

<212> DNA 

<213> Homo sapiens 



<400> 447 

aggcatcaag ctttcctt 



18 



<210> 448 

<211> 18 

<212> DNA 

<213> Homo sapiens 



<400> 448 

ggtttagaga accgagcc 



18 



<210> 449 

<211> 18 

<212> DNA 

<213> Homo sapiens 



<400> 449 

gtggtgctgc aagttacc 



18 



<210> 450 

<211> 18 

<212> DNA 

<213> Homo sapiens 



<400> 450 

ggaatccctt tctttcca 



18 



<210> 451 

<211> 18 

<212> DNA 

<213> Homo sapiens 



<400> 451 

gaccatttgt tacgcagc 



18 



<210> 452 
<211> 19 
<212> DNA 
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<213> Homo 



sapiens 



<400> 452 
gatgggtgtg 



aatgaacaa 



19 



<210> 453 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 453 

ctcaagcttc tgttcatgc 19 

<210> 454 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 454 

gctgtgagtg tcttggct 18 

<210> 455 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 455 

tacagaaaac cgcagctc 18 

<210> 456 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 456 

gccaccaaag gaaagatt 18 

<210> 457 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 457 

aaaaggaggg aatcatgg 18 

<210> 458 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 458 

tcacttagca ggaggcag 18 
<210> 459 
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<211> 18 
<212> DNA 
<213> Homo 



sapiens 



<400> 459 
ctgagcatcc 



gatgagac 



18 



<210> 460 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 460 

gtgcaaaatg agcagctt 18 

<210> 461 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 461 

tctaacccct tactgggc 18 

<210> 462 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 462 

tcctcaaact gggaatga 18 

<210> 463 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 463 

tttacacagg accaggga 18 

<210> 464 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 464 

atctccccca ctcagaag 18 

<210> 465 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 465 

gtccacgggc tttattct 18 
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<210> 466 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 466 

tgagcataaa tttcattagc tg 22 

<210> 467 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 467 

ggaagagcaa aataaatcca 20 

<210> 468 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 468 

ggtgcacaga attgttcat 19 

<210> 469 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 469 

agcacgctta tttcatgg 18 

<210> 470 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 470 

gtaacaccag cagggaca 18 

<210> 471 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 471 

tcctgctgca ttatggat 18 

<210> 472 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 472 
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ggg9gtgaga agtaggaa 18 

<210> 473 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 473 

atggggatta aatacggg 18 

<210> 474 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 474 

agctagcatt gggctctt 18 

<210> 475 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 475 

ctgaggagaa gaggctgg 18 

<210> 476 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 476 

cgccttacaa ggcaagta 18 

<210> 477 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 477 

aggatgcttg ctagggtt 18 

<210> 478 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 478 

cacaagtgtc tggaaggc 18 

<210> 479 

<211> 18 

<212> DNA 

<213> Homo sapiens 
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<400> 479 

ggtctcagga gcccttta 18 

<210> 480 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 480 

acatgccact cttctcacta a 21 

<210> 481 

<211> 18 

<212> DMA 

<213> Homo sapiens 

<400> 481 

acttaaccaa ggatgggg 18 

<210> 482 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 482 

caacccacga gcataaga 18 

<210> 483 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 483 

taggctctgc actcttgg 18 

<210> 484 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 484 

acccacggag tctctctc 18 

<210> 485 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 485 

taaaggcggt gaagtgag 18 



<210> 486 
<211> 18 
<212> DNA 
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<213> Homo 



sapiens 



<400> 486 
ctaccgctct 



cctaggct 



18 



<210> 487 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 487 

tggggccaga taattctt 18 

<210> 488 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 488 

ctggtgtttg gtggtgtt 18 

<210> 489 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 489 

aaggaagagg tcaccagg 18 

<210> 490 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 490 

cacaaattcc atttccca 18 

<210> 491 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 491 

tcaataggtg atccaacatt t 21 

<210> 492 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 492 

aaagtcccac aaagggtc 18 
<210> 493 
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<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 493 

gggtaggggg atcttttt 18 

<210> 494 

<211> 18 

<212> DMA 

<213> Homo sapiens 

<400> 494 

tgtggaacat tcattggc 18 

<210> 495 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 495 

gtcctgggaa agatggaa 18 

<210> 496 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 496 

tcaaagcgtc tcccataa 18 

<210> 497 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 497 

tctttcgctg tacttggc 18 

<210> 498 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 498 

tgggaggtca gagtgatg is 

<210> 499 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 499 

ggacagtgta tgtgttggg 19 
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<210> 500 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 500 

aggcagctgt ttttgtga 18 

<210> 501 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 501 

cttcttgagt cccgtgtg 18 

<210> 502 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 502 

caaccgagaa tcctctagc 19 

<210> 503 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 503 

gctgggagag aatcacaa 18 

<210> 504 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 504 

gctttgcaga agagacca 18 

<210> 505 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 505 

acgctgtcag gtcacact 18 

<210> 506 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 506 
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ggaggatgct caggtgat 18 

<210> 507 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 507 

tagggggatc tttttcca 18 

<210> 508 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 508 

gagcaatttg aaaagcca 18 

<210> 509 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 509 

atggtccagc tcctctgt 18 

<210> 510 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 510 

atagagcacc ccatctcc 18 

<210> 511 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 511 

aacattgctg ttagccca 18 

<210> 512 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 512 

gcaatcgaaa cagcattc 18 

<210> 513 
<211> 18 
<212> DNA 

<213> Homo sapiens 
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<400> 513 

atgagttggc agctgaag 18 

<210> 514 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 514 

aatgaaggtc ttgcctcc IB 

<210> 515 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 515 

gaggagaaga tccacaagcg 2 0 

<210> 516 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 516 

tctctggggc atactgaacc 20 

<210> 517 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 517 

ctgagctttt ggcactgt 18 

<210> 518 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 518 

ctgctaggtg acagcagg 18 

<210> 519 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 519 

tgtatgagtc tggagggtgt 20 



<210> 520 
<211> 18 
<212> DNA 
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<213> Homo 



sapiens 



<400> 520 
acacctggct 



gaggaaat 



18 



<210> 521 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 521 

gcaggggacg tgataata 18 

<210> 522 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 522 

ttttgcttcc taccatgc 18 

<210> 523 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 523 

aaaattgtga gcacctcc 18 

<210> 524 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 524 

tttatattta aagtggcttt gtt 23 

<210> 525 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 525 

gtgcaaagcc cacagtat 18 

<210> 526 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 526 

aggaaaatgc aagagcag 18 
<210> 527 
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<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 527 

ccactgaatt gcatactttg 

<210> 528 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 528 

tctgggtcca gtctgcta 

<210> 529 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 529 

agattttggg gagtcagg 

<210> 530 
<211> 17 
<212> DNA 

<213> Homo sapiens 
<400> 530 

gcgctcaagc aattctc 

<210> 531 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 531 

caagccccaa agtagtca 

<210> 532 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 532 

gaatcatcca atccacga 

<210> 533 
<211> 18 
<212> DNA 
• <213> Homo sapiens 

<400> 533 

agcctccagg tgactacc 
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20 



18 



18 



17 



18 



18 
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<210> 534 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 534 

gaaggacatg gtcagcag 18 

<210> 535 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 535 

atgctttcag cattttcg 18 

<210> 536 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 536 

tgatccgtgg tagggtta 18 

<210> 537 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 537 

gtcggattgg tttcacaa 18 

<210> 538 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 538 

ttttatggga atttcagcc 19 

<210> 539 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 539 

tttggaaaag aacagaaatg t 21 

<210> 540 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 540 
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ggctagtctt tcctgaacc 

<210> 541 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 541 

ccttaatgcc cctgattc 

<210> 542 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 542 

gcgtttacaa gctgagga 

<210> 543 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 543 

tcaagcttgc tttctcaa 

<210> 544 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 544 

gtagcccagc aagtgtct 

<210> 545 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 545 

cctggctgga gataggat 

<210> 546 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 546 

cttcccctct gcctatgt 

<210> 547 

<211> 18 

<212> DNA 

<213> Homo sapiens 
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19 

18 

18 

18 

18 

18 
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<400> 547 

ggcacgtact tcctacca 

<210> 548 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 548 

ggtgcttctt acaggcaa 

<210> 549 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 549 
acccaggctg gtgtgt 

<210> 550 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 550 

actgagttaa ttatcactcc cct 

<210> 551 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 551 

gatgcatttt gcttcacc 

<210> 552 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 552 

tctgctttta gagctgttag c 

<210> 553 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 553 

tcaagcttca aagagcaga 

<210> 554 
<211> 18 
<212> DNA 



18 



18 



16 



23 



18 



21 



19 



-179- 



WO 02/092000 



PCT/US02/14877 



<213> Homo 



sapiens 



<400> 554 
ggagtacatc 



ccaggacc 



18 



<210> 555 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 555 

tggtgctttt aaatccaga 19 

<210> 556 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 556 

ctcccttact tacttgcatt g 21 

<210> 557 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 557 

tcttctccca gggaatct 18 

<210> 558 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 558 

tttatgtccc ctgagcac 18 

<210> 559 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 559 

tccctggcta tcttgaatc 19 

<210> 560 
<211> 16 
<212> DNA 

<213> Homo sapiens 
<400> 560 

cttgactggg tccacg 16 
<210> 561 
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<211> 20 
<212> DNA 
<213> Homo 



sapiens 



<400> 561 
cgagacgcca 



gtagatacca 



20 



<210> 562 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 562 

catcctccat gcctttcagt 20 

<210> 563 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 563 

agttccagag aacgagacgc 20 

<210> 564 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 564 

cttgtcatcc tccatgcctt 20 

<210> 565 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 565 

gagcgtgaga ggttgaggag 2 0 

<210> 566 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 566 

aaacaaactc cagacgcacc 20 

<210> 567 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 567 

ctgaaccact acctgtatga cctg 24 
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<210> 568 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 568 

ctaactactt actcctacag ggccc 25 

<210* 569 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 569 

gaagcatttc aatactttaa ctg 23 

<210> 570 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 570 

ccactccagt gcacccaatc 20 

<210> 571 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 571 

cttctcctgg ccactctgac 20 

<210> 572 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 572 

ggtttacctt tgaatcccag c 21 

<210> 573 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 573 

tgaggatgaa tgagcacata gg 22 

<210> 574 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 574 
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tttgtggtcc attgagtagg c 

<210> 575 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 575 

aggggaagga atgtgcttgg 

<210> 576 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 576 

ttcggctgag cgggcagtgt 



<210> 577 
<211> 26 
<212> DNA 

<213> Homo sapiens 
<400> 577 

attgaaggtc ctccaaaaga atgctg 

<210> 578 
<211> 30 
<212> DNA 

<213> Homo sapiens 
<400> 578 

agaacgtcaa catatctttt tgggggacac 

<210> 579 

<211> 22 

<212> DNA 

<213> Homo sapiens 



26 



30 



<400> 579 

ttgtatttga ggactttgct eg 

<210> 580 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 580 

cggtaccatc ctcctcttcc 

<210> 581 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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<400> 581 

tttttgcctc atctatgccc 

<210> 582 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 582 

gggtgacaga gcaagactcc 

<210> 583 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 583 

ttgctcaagt tctcctgg 

<210> 584 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 584 

accttgtttt gaggggag 

<210> 585 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 585 

cttggctatt tggacagc 

<210> 586 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 586 

gggcatttac tcacttgc 

<210> 587 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 587 

cttgtgtcag ttgtcaggg 

<210> 588 
<211> 19 
<212> DNA 
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20 



20 



18 



18 



18 



18 
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<213> Homo sapiens 
<400> 588 

tggaattgtt gtgtcttgg 19 

<210> 589 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 589 

ccagttccac tggatgtt 18 

<210> 590 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 590 

atgggctgtg tttctcaa 18 

<210> 591 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 591 

ctgcctatcc ctggactt 18 

<210> 592 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 592 

agtttgtccc tagtgccc 18 

<210> 593 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 593 

caacacgtct gacatccat 19 

<210> 594 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 594 

ggatagtgca caccca 16 
<210> 595 
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<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 595 

tgggtggtac tattgttccc at 22 

<210> 596 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 596 

agttccagcc cccttaccag 20 

<210> 597 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 597 

ggccactatc atccctgtgt 2 0 

<210> 598 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 598 

tttcacatgg gaagaacacg 2 0 

<210> 599 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 599 

acagtgacac tagggacggg 2 0 

<210> 600 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 600 

tgccaggatg gagataacaa 2 0 

<210> 601 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 601 

cctgtggcac acatatcacc 20 
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<210> 602 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 602 

acaaccaaga atggagccac 

<210> 603 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 603 

tgctgtgtaa caagtcccca 

<210> 604 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 604 

tgaacggagg acctaccaag 

<210> 605 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 605 

gcagggtccg actcactaag 

<210> 606 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 606 

gctgtgagtt ccctttacgc 

<210> 607 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 607 

acagtgggga caaagacagg 

<210> 608 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 608 
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tacagggcac ctcccagtag 

<210> 609 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 609 

tcttctgtta aggtttcccc c 

<210> 610 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 610 

tgtctcaaac ctccctctgc 

<210> 611 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 611 

aacatatttc ctccccagcc 

<210> 612 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 612 

cagtcccagc caatgagaa 

<210> 613 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 613 

ctcctctgca tgggagaatc 

<210> 614 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 614 

agacctggga ccagtctgtg 

<210> 615 

<211> 20 

<212> DNA 

<213> Homo sapiens 



21 



20 



20 



19 



20 



20 
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<400> 615 

gggagacgac gtcacaagat 20 

<210> 616 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 616 

tgatgttggg aagatggtga 20 

<210> 617 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 617 

caggcatctt ctatgtgcca 20 



<210> 618 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 618 

gggaggcaca agttctttca 2 0 

<210> 619 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 619 

acttcgtggc actgagtgtg 20 

<210> 620 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 620 

cctttcttac ggatgaggca 20 

<210> 621 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 621 

ggctgctgag ctcttctgat 2 0 

<210> 622 
<211> 20 
<212> DNA 
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<213> Homo sapiens 
<400> 622 

tgggtctctc tgcctgactt 20 

<210> 623 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 623 

tcacctactt ccagcttccg 20 

<210> 624 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 624 

agacctggga ccagtctgtg 2 0 

<210> 625 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 625 

ctcctctgca tgggagaatc 20 

<210> 626 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 626 

aattcaggag acctgggacc 2 0 

<210> 627 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a BstXI-linker adapter. 
<400> 627 

gtcttcacca cgggg 15 

<210> 628 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Artificial Sequence is a BstXI-linker adapter. 
<400> 628 

gtggtgaaga c 11 

<210> 629 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
<400> 629 

ccaagttctg agaagtcc 18 

<210> 630 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
<400> 630 

aatacctgaa accatac 17 

<210> 631 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is an allele specific oligonucleotide. 
<400> 631 

agactggggt gagacgc 17 

<210> 632 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is an allele specific oligonucleotide. 
<400> 632 

cagactgggt tgagacgcc 19 

<210> 633 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Artificial Sequence is a primer. 
<400> 633 

cccgtgtgct ccgccgccca gttc 24 

<210> 634 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
<400> 634 

ggctcacgga gctcatcatg gactt 25 

<210> 635 
<211> 502 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 



<400> 635 

cccgtgtgct ccgccgccca gttcccctgc gcgcggggtc agtgtgtgga cctgcgcctg 60 
cgctgcgacg gcgaggcaga ctgtcaggac cgctcagacg aggtggactg tgacgccatc 12 0 
tgcctgccca accagttccg gtgtgcgagc ggccagtgtg tcctcatcaa acagcagtgc 180 
gactccttcc ccgactgtat cgacggctcc gacgagctca tgtgtgaaat caccaagccg 24 0 
ccctcagacg acagcccggc ccacagcagt gccatcgggc ccgtcattgg catcatcctc 300 
tctctcttcg tcatgggtgg tgtctatttt gtgtgccagc gcgtggtgtg ccagcgctat 3 60 
gcgggggcca acgggccctt cccgcacgag tatgtcagcg ggaccccgca cgtgcccctc 420 
aatttcatag ccccgggcgg ttcccagcat ggccccttca caggcatcgc atgcggaaag 480 
tccatgatga gctccgtgag cc 502 

<210> 636 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
<400> 636 

agcgaggcca ccatccacag g 21 

<210> 637 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
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<400> 637 

tcgctggtcg gcataatcaa t 21 

<210> 638 

<211> 501 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 



<400> 638 

agcagagcca ccatccacag gatctccctg gagactaaca acaacgatgt ggctatccca 60 
ctcacgggtg tcaaagaggc ctctgcactg gactttgatg tgtccaacaa tcacatctac 12 0 
tggactgatg ttagcctcaa gacgatcagc cgagccttca tgaatgggag ctcagtggag 18 0 
cacgtgattg agtttggcct cgactaccct gaaggaatgg ctgtggactg gatgggcaag 240 
aacctctatt gggcggacac agggaccaac aggattgagg tggcccggct ggatgggcag 300 
ttccggcagg tgcttgtgtg gagagacctt gacaacccca ggtctctggc tctggatcct 3 60 
actaaaggct acatctactg gactgagtgg ggtggcaagc caaggattgt gcgggccttc 420 
atggatggga ccaattgtat gacactggta gacaaggtgg gccgggccaa cgacctcacc 480 
attgattatg ccgaccagcg a 501 

<210> 639 
<211> 26 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a Zmaxl oligonucleotide. 
<400> 639 

raguacagcu ucuugccaac ccaguc 26 

<210> 640 
<211> 26 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a Zmaxl oligonucleotide. 
<400> 640 

ruccuccagg ucgaugguca gcccau 26 

<210> 641 
<211> 26 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a Zmaxl oligonucleotide. 
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<400> 641 

rgucugaguc cgaguucaaa uccagg 26 

<210> 642 

<211> 3081 

<212> DNA 

<213> Homo sapiens 

<400> 642 

cccgccagcc cagcccagcc caaccctact ccctccccac gccagggcag cagccgttgc 60 

tcagagagaa ggtggaggaa gaaatccaga ccctagcacg cgcgcaccat catggaccat 120 

tatgattctc agcaaaccaa cgattacatg cagccagaag aggactggga ccgggacctg 180 

ctcctggacc cggcctggga gaagcagcag agaaagacat tcacggcatg gtgtaactcc 240 

cacctccgga aggcggggac acagatcgag aacatcgaag aggacttccg ggatggcctg 300 

aagctcatgc tgctgctgga ggtcatctca ggtgaacgct tggccaagcc agagcgaggc 360 

aagatgagag tgcacaagat ctccaacgtc aacaaggccc tggatttcat agccagcaaa 420 

ggcgtcaaac tggtgtccat cggagccgaa gaaatcgtgg atgggaatgt gaagatgacc 480 

• ctgggcatga tctggaccat catcctgcgc tttgccatcc aggacatctc cgtggaagag 540 

acttcagcca aggaagggct gctcctgtgg tgtcagagaa agacagcccc ttacaaaaat 600 

gtcaacatcc agaacttcca cataagctgg aaggatggcc tcggcttctg tgctttgatc 660 

caccgacacc ggcccgagct gattgactac gggaagctgc ggaaggatga tccactcaca 720 

aatctgaata cggcttttga cgtggcagag aagtacctgg acatccccaa gatgctggat 780 

gccgaagaca tcgttggaac tgcccgaccg gatgagaaag ccatcatgac ttacgtgtct 84 0 

agcttctacc acgccttctc tggagcccag aaggcggaga cagcagccaa tcgcatctgc 900 

aaggtgttgg ccgtcaacca ggagaacgag cagcttatgg aagactacga gaagctggcc 960 

agtgatctgt tggagtggat ccgccgcaca atcccgtggc tggagaaccg ggtgcccgag 102 0 

aacaccatgc atgccatgca acagaagctg gaggacttcc gggactaccg gcgcctgcac 1080 

aagccgccca aggtgcagga gaagtgccag ctggagatca acttcaacac gctgcagacc 1140 

aagctgcggc tcagcaaccg gcctgccttc atgccctctg agggcaggat ggtctcggac 1200 

atcaacaatg cctggggctg cctggagcag gtggagaagg gctatgagga gtggttgctg 12 60 

aatgagatcc ggaggctgga gcgactggac cacctggcag agaagttccg gcagaaggcc 132 0 

tccatccacg aggcctggac tgacggcaaa gaggccatgc tgcgacagaa ggactatgag 13 80 

accgccaccc tctcggagat caaggccttg ctcaagaagc atgaggcctt cgagagtgac 1440 

ctggctgccc accaggaccg tgtggagcag attgccgcca tcgcacagga gctcaatgag 1500 

ctggactatt atgactcacc cagtgtcaac gcccgttgcc aaaagatctg tgaccagtgg 1560 

gacaatctgg gggccctaac tcagaagcga agggaagctc tggagcggac. cgagaaactg 162 0 

ctggagacca ttgaccagct gtacttggag tatgccaagc gggctgcacc cttcaacaac 1680 

tggatggagg gggccatgga ggacctgcag gacaccttca ttgtgcacac cattgaggag 1740 

atccagggac tgaccacagc ccatgagcag ttcaaggcca ccctccctga tgccgacaag 1800 

gagcgcctgg ccatcctggg catccacaat gaggtgtcca agattgtcca gacctaccac 1860 

gtcaatatgg cgggcaccaa cccctacaca accatcacgc ctcaggagat caatggcaaa 1920 

tgggaccacg tgcggcagct ggtgcctcgg agggaccaag ctctgacgga ggagcatgcc 1980 

cgacagcagc acaatgagag gctacgcaag cagtttggag cccaggccaa tgtcatcggg 2040 

ccctggatcc agaccaagat ggaggagatc gggaggatct ccattgagat gcatgggacc 2100 

ctggaggacc agctcagcca cctgcggcag tatgagaaga gcatcgtcaa ctacaagcca 2160 

aagattgatc agctggaggg cgaccaccag ctcatccagg aggcgctcat cttcgacaac 2220 

aagcacacca actacaccat ggagcacatc cgtgtgggct gggagcagct gctcaccacc 2280 

atcgccagga ccatcaatga ggtagagaac cagatcctga cccgggatgc caagggcatc 2340 

agccaggagc agatgaatga gttccgggcc tccttcaacc actttgaccg ggatcactcc 2400 

ggcacactgg gtcccgagga gttcaaagcc tgcctcatca gcttgggtta tgatattggc 2460 

aacgaccccc agggagaagc agaatttgcc cgcatcatga gcattgtgga ccccaaccgc 2520 

ctgggggtag tgacattcca ggccttcatt gacttcatgt cccgcgagac agccgacaca 2580 

gatacagcag accaagtcat ggcttccttc aagatcctgg ctggggacaa gaactacatt 2640 

accatggacg agctgcgccg cgagctgcca cccgaccagg ctgagtactg catcgcgcgg 2700 
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atggccccct acaccggccc cgactccgtg ccaggtgctc tggactacat gtccttctcc 2760 

acggcgctgt acggcgagag tgacctctaa tccaccccgc ccggccgccc tcgtcttgtg 2 82 0 

cgccgtgccc acagatgtga aatgaatgta atctaataga agcctaatca gcccaccatg 2880 

ttctccactg aaaaatcctc tttctttggg gtttttcttt ctttcttttt tgattttgca 2940 

ctggacggtg acgtcagcct gtacaggctc ccaggggtgg cgtcaaatgc tattgaaatt 3000 

gcgctgaatc gtatgctttt tccttttgat aaataaacaa tgtaaaaatg tttcaaaaac 3 060 

ctaataaaat aaataaatac g 3081 

<210> 643 

<211> 1324 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (1324) 
<223> n = A,T,C or G 



<400> 643 

ggccgcccgg cgcccccagc agnccgagcc ggggcgcaca gncggggcgc agcccgcgcc 60 

ccccgccgcg attgacatga tgtttccaca aagcaggcat tcgggctcct cgcacctacc 120 

ccagcaactc aaattcacca cctcggactc ctgcgaccgc atcaaagacg aatttcagct 180 

actgcaagct cagtaccaca gcctcaagct cgaatgtgac aagttggcca gtgagaagtc 240 

agagatgcag cgtcactatg tgatgtacta cgagatgtcc tacggcttga acatcgagat 300 

gcacaaacag gctgagatcg tcaaaaggct gaacgggatt tgtgcccagg tcctgcccta 360 

cctctcccaa gagcaccagc agcaggtctt gggagccatt gagagggcca agcaggtcac 420 

cgctcccgag ctgaactcta tcatccgaca gcagctccaa gcccaccagc tgtcccagct 480 

gcaggccctg gccctgccct tgaccccact acccgtgggg ctgcagccgc cttcgctgcc 54 0 

ggcggtcagc gcaggcaccg gcctcctctc gctgtccgcg ctgggttccc aggcccacct 600 

ctccaaggaa gacaagaacg ggcacgatgg tgacacccac caggaggatg atggcgagaa 660 

gtcggattag cagggggccg ggacagggag gttgggaggg gggacagagg ggagacagag 72 0 

gcacggagag aaaggaatgt ttagcacaag acacagcgga gctcgggatt ggctaatctc 780 

ccatagtatt tatggtggcg ccggcggggc cccagcccag cttgcaggcc acctctagct 840 

ttcttcctac cccattccgg cttccctcct cctcccctgc agcctggtta ggtggatacc 900 

tgccctgaca tgtgaggcaa gctaaggccfc ggagggtcag atgggagacc aggtcccaag 960 

ggagcaagac ctgcgaagcg cagcagcccc ggcccttccc ccgttttgaa catgtgtaac 102 0 

cgacagtctg ccctgggcca cagccctctc accctggtac tgcatgcacg caatgctagc 1080 

tgcccctttc ccgtcctggg caccccgagt ctcccccgac cccgggtccc aggtatgctc 114 0 

ccacctccac ctgccccact caccacctct gctagttcca gacacctcca cgcccacctg 1200 

gtcctctccc atcgcccaca aaaggggggg cacgagggac gagcttagct gagctgggag 1260 

gagcagggtg agggtgggcg acccaggatt ccccctcccc ttcccaaata aagatgaggg 1320 

tact 1324 

<210> 644 

<211> 2377 

<212> DNA 

<213> Homo sapiens 



<400> 644 

ggtgacaaag agccaacaga gacaatagga gacttgtcaa tttgtcttga tgggctacag 60 

ttagagtctg aagttgttac caatggtgaa actacatgtt cagaaagtgc ttctcagaat 12 0 

gatgatggct ccagatccaa ggatgaaaca agagtgagca caaatggatc agatgaccct 180 

gaagatgcag gagctggtga aaataggaga gtcagtggga ataattctcc atcactctca 240 

aatggtggtt ttaaaccttc tagacctcca agaccttcac gaccaccacc acccacccca 300 
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cgtagaccag catctgtcaa tggtt caeca tctgccactt ctgaaagtga tgggtctagt 360 

acaggctctc tgccgccgac aaatacaaat acaaatacat ctgaaggagc aacatctgga 420 

ttaataattc ctcttactat atctggaggc tcaggcccta ggccattaaa tcctgtaact 480 

caagctccct tgccacctgg ttgggagcag agagtggacc ageaegggeg agtttactat 540 

gtagatcatg ttgagaaaag aacaacatgg gatagaccag aacctctacc tcctggctgg 600 

gaacggcggg ttgacaacat gggaegtatt tattatgttg accatttcac aagaacaaca 660 

acgtggcaga ggccaacact ggaatccgtc eggaactatg aacaatggca getacagegt 720 

agtcagcttc aaggagcaat gcagcagttt aaccagagat tcatttatgg gaatcaagat 780 

ttatttgeta catcacaaag taaagaattt gatcctcttg gtccattgcc acctggatgg 840 

gagaagagaa cagacagcaa tggcagagta tatttegtea accacaacac acgaattaca 900 

caatgggaag accccagaag tcaaggtcaa ttaaatgaaa agcccttacc tgaaggttgg 960 

gaaatgagat tcacagtgga tggaattcca tattttgtgg accacaatag aagaactacc 1020 

acctatatag atccccgcac aggaaaatct gccctagaca atggacctca gatagectat 1080 

gttegggact tcaaagcaaa ggttcagtat ttccggttct ggtgtcagca actggccatg 1140 

ccacagcaca taaagattac agtgacaaga aaaacattgt ttgaggattc ctttcaacag 1200 

ataatgagct tcagtcccca agatctgega agacgtttgt gggtgatttt tccaggagaa 1260 

gaaggtttag attatggagg tgtagcaaga gaatggttct ttcttttgtc acatgaagtg 132 0 

ttgaacccaa tgtattgect gtttgaatat gcagggaagg ataactactg ettgeagata 1380 

aaccccgctt cttacatcaa tccagatcac ctgaaatatt ttcgttttat tggcagattt 1440 

attgecatgg ctctgttcca tgggaaattc atagacaegg gtttttcttt accattctat 1500 

aagegtatet tgaacaaacc agttggactc aaggatttag aatctattga tccagaattt 15 60 

tacaattctc tcatctgggt taaggaaaac aatattgagg aatgtgattt ggaaatgtac 1620 

ttctccgttg acaaagaaat tctaggtgaa attaagagtc atgatctgaa acctaatggt 1680 

ggcaatattc ttgtaacaga agaaaataaa gaggaataca tcagaatggt agctgagtgg 1740 

aggttgtctc gaggtgttga agaacagaca caagctttct ttgaaggctt taatgaaatt 1800 

cttccccagc aatatttgea atactttgat gcaaaggaat tagaggtcct tttatgtgga 1860 

atgeaagaga ttgatttgaa tgactggcaa agacatgeca tctaccgtca ttatgeaagg 1920 

accagcaaac aaatcatgtg gttttggcag tttgttaaag aaattgataa tgagaagaga 1980 

atgagacttc tgcagtttgt tactggaacc tgccgattgc cagtaggagg atttgetgat 2 040 

ctcatgggga gcaatggacc acagaaattc tgcattgaaa aagttgggaa agaaaattgg 2100 

ctacccagaa gtcatacctg ttttaatege ctggacctgc caccatacaa gagctatgag 2160 

caactgaagg aaaagctgtt gtttgecata gaagaaacag aaggatttgg acaagagtaa 2220 

cttctgagaa cttgcaccat gaatgggcaa gaacttattt gcaatgtttg tccttctctg 2280 

cctgttgcac atcttgtaaa attggacaat ggctctttag agagttatct gagtgtaagt 2340 

aaattaatgt tctcatttaa aaaaaaaaaa aaaaaaa 23 77 

<210> 645 
<211> 1295 
<212> DNA 

<213> Homo sapiens 
<400> 645 

gggccgccgc ccacccgggc cttgcctcta cctcagtcgt tgccccccga ttttcggctg 60 

gagcccacgg ccccggccct cagcccccgc tctagcttcg ccagtagctc ggccagcgac 120 

gegagcaage cgtccagccc ccggggcagc ctgctgctgg aeggggeggg ggctggcgga 180 

gctggaggta gccggccctg cagcaatcgc accagcggca tcagcatggg ctacgaccag 240 

cgccacggga gccccttgcc ageggggecg tgcctgtttg gcccacccct ggceggagea 3 00 

ccggcaggct attctcccgg aggggtcccg tccgcctacc cggagctcca cgccgccctg 360 

gaccgattgt acgctcagcg gcccgcgggg ttcggctgcc aggaaagecg ccactcgtat 420 

cccccggccc tgggcagccc tggagctcta gccggggccc gagtgggagc ggcggggccc 4 80 

ttggagagac ggggggegea acccggacga cactctgtga ccggctacgg ggactgcgcc 540 

gtgggcgccc ggtaccagga cgagctaaca getttgette gectgaeggt gggcaccggt 600 

gggcgagaag ccggagcccg cggagaaccc teggggattg ageegteggg tctggaggag 660 

ccaccaggtc ctttcgttcc ggaggccgcc cgggcccgga tgegggagee agaggecagg 720 
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gaggactact tcggcacctg tatcaagtgc aacaaaggca tctatgggca gagcaatgcc 780 

tgccaggccc tggacagcct ctaccacacc cagtgctttg tttgctgctc ttgtgggcga 84 0 

actttgcgtt gcaaggcttt ctacagtgtc aatggctctg tgtactgtga ggaagattat 900 

ctgttttcag ggtttcagga ggcagctgag aaatgctgtg tctgtggtca cttgattttg 960 

gagaagatcc tacaagcaat ggggaagtcc tatcatccag gctgtttccg atgcattgtt 1020 

tgcaacaagt gcctggatgg catccccttc acagtggact tctccaacca agtatactgt 1080 

gtcaccgact accacaaaaa ttatgctcct aagtgtgcag cctgtggcca acccatcctc 114 0 

ccctctgagg gctgtgagga catcgtgagg gtgatatcca tggaccggga ttatcacttt 1200 

gagtgctacc actgtgagga ctgccggatg cagctgagtg atgaggaagg ctgctgctgt 1260 

ttccctctgg atgggcactt gctctgccat ggttg 12 95 

<210> 646 

<211> 3411 

<212> DNA 

<213> Homo sapiens 

<400> 646 

gggcccgggg tcccgccacc accgcgcgcg ggacagattg attcactttg gagctgtaag 60 

tactgatgta ttagggtgca gcgctcattg ttcattgacg cagagtccca aaatgaatat 120 

ccaagagcag ggtttcccct tggacctcgg agcaagtttc accgaagatg ctccccgacc 18 0 

cccagtgcct ggtgaggagg gagaactggt gtccacagac ccgaggcccg ccagctacag 240 

tttctgctcc gggaaaggtg ttggcattaa aggtgagact tcgacggcca ctccgaggcg 3 00 

ctcggatctg gacctggggt atgagcctga gggcagtgcc tcccccaccc caccatactt 3 60 

gaagtgggct gagtcactgc attccctgct ggatgaccaa gatgggataa gcctgttcag 420 

gactttcctg aagcaggagg gctgtgccga cttgctggac ttctggtttg cctgcactgg 4 80 

cttcaggaag ctggagccct gtgactcgaa cgaggagaag aggctgaagc tggcgagagc 54 0 

catctaccga aagtacattc ttgataacaa tggcatcgtg tcccggcaga ccaagccagc 600 

caccaagagc ttcataaagg gctgcatcat gaagcagctg atcgatcctg ccatgtttga 660 

ccaggcccag accgaaatcc aggccactat ggaggaaaac acctatccct ccttccttaa 720 

gtctgatatt tatttggaat atacgaggac aggctcggag agccccaaag tctgtagtga 7 80 

ccagagctct gggtcaggga cagggaaggg catatctgga tacctgccga ccttaaatga 840 

agatgaggaa tggaagtgtg accaggacat ggacgaggac gatggcagag acgctgctcc 900 

ccccggaaga ctccctcaga agctgctcct ggagacagct gccccgaggg tctcctccag 960 

tagacggtac agcgaaggca gagagttcag gtatggatcc tggcgggagc cagtcaaccc 102 0 

ctattatgtc aatgccggct atgccctggc cccagccacc agtgccaacg acagcgagca 1080 

gcagagcctg tccagcgatg cagacaccct gtccctcacg gacagcagcg tggatgggat 1140 

ccccccatac aggatccgta agcagcaccg cagggagatg caggagagcg cgcaggtcaa 1200 

tgggcgggtg cccctacctc acattccccg cacgtaccgg gtgccgaagg aggtccgcgt 1260 

ggagcctcag aagttcgcgg aggagctcat ccaccgcctg gaggctgtgc agcgcacgcg 132 0 

ggaggccgag gagaagctgg aggagcggct gaagcgcgtg cgcatggagg aggaaggtga 13 80 

ggacggcgat ccatcgtcag ggcccccagg gccgtgtcac aagctgcctc ccgcccccgc 1440 

ttggcaccac ttcccgcccc gcttgtgttg gacatgggct tgtgccgggc tccgggatgc 1500 

acacgaggag aaccctgaga gcatcctgga cgagcacgta cagcgtgtgc tgaggacaac 1560 

tggccgccag tcgcctgggc ctggccatcg ctccccggac agtgggcacg tggccaagat 1620 

gccagtggca ctggggggtg ccgcctcggg gcacgggaag cacgtaccca agtcaggggc 1680 

gaagctggac gcggccggcc tgcaccacca ccgacacgtc caccaccacg tccaccacag 1740 

cacagcccgg cccaaggagc aggtggaggc cgaggccacc cgcagggccc agagcagctt 1800 

cgcctggggc ctggaaccac acagccatgg ggcaaggtcc cgaggctact cagagagtgt 1860 

tggcgctgcc cccaacgcca gtgatggcct cgcccacagt gggaaggtgg gcgttgcgtg 1920 

caaaagaaat gccaagaagg ctgagtcggg gaagagcgcc agcaccgagg tgccaggtgc 1980 

ctcggaggat gcggagaaga accagaaaat catgcagtgg atcattgagg gggaaaagga 2040 

gatcagcagg caccgcagga ccggccacgg gtcttcgggg acgaggaagc cacagcccca 2100 

tgagaactcc agaccyttgt cccttgagca cccctgggcc ggccctcagc tccggacctc 2160 

cgtgcagccc tcccacctct tcatccaaga ccccaccatg ccaccccacc cagctcccaa 2220 
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ccccctaacc cagctggagg aggcgcgccg acgtctggag gaggaagaaa agagagccag 22 80 

ccgagcaccc tccaagcaga ggtatgtgca ggaggttatg cggcggggac gcgcctgcgt 234 0 

caggccagcg tgcgcgccgg tgctgcacgt ggtaccagcc gtgtcggaca tggagctctc 2400 

cgagacagag acaagatcgc agaggaaggt gggcggcggg agtgcccagc cgtgtgacag 2460 

catcgttgtg gcgtactact tctgcgggga acccatcccc taccgcaccc tggtgagggg 2520 

ccgcgctgtc accctgggcc agttcaagga gctgctgacc aaaaagggca gctacagata 2580 

ctacttcaag aaagtgagcg acgagtttga ctgtggggtg gtgtttgagg aggttcgaga 2640 

ggacgaggcc gtcctgcccg tctttgagga gaagatcatc ggcaaagtgg agaaggtgga 2700 

ctgataggct ggtgggctgg ccgctgtgcc aggcgaggcc cttggcgggc acgggtgtca 27 60 

cggccaggca gatgacctcg tactcaggag cccgatgggg aacagtgttg ggtgtaccac 2 820 

ccatccctgt ggtctacccg tgtctagagg caggtagggg gtccctccaa gtggtccaca 2 8 80 

agcttctgtc ctgcccccaa ggaggcagcc tggaccactc ctcatagcaa tacttggagg 2940 

gcccagccca agtgaggcag ccgaggtccc tgctgccagc ttcaggtgac ccccccccat 3 000 

cccccggcac ctcccttggg cacgtgtgct gggatctact ttccctctgg gatttgccca 3 060 

cgtacccagg tctggctggg gcccaggccc ggatgcagag gcctgcaggg cctctgtcaa 3120 

ttgtacgcgc caccaagtgc cttcaacaca gcttgtctct tgcctgccac tgtgtgaatc 3180 

ggcgacggag cactgcacct gcctccagcc gccggctgtg cagtcctggg tcctcctttc 3240 

tgagggcccg tgtaaatatg tacatttctc aggctagggc cagcaggggc tgcccgagtc 33 00 

tgtttttcat gcgatgacac ttgtacaatt atcttttcaa aggtacttgg ataataatga 3360 

aataaaactg tttttgaacc tgaataaaaa aaaaaaaaaa aaaaaaaaaa a 3411 

<210> 647 

<211> 3140 

<212> DNA 

<213> Homo sapiens 

<400> 647 

ggctgcgagt acctccatgg tcccggtggc tgtgacggcg gcagtggcgc ctgtcctgtc 60 

cataaacagc gatttctcag atttgcggga aattaaaaag caactgctgc ttattgcggg 120 

ccttacccgg gagcggggcc tactacacag tagcaaatgg tcggcggagt tggctttctc 180 

tctccctgca ttgcctctgg ccgagctgca accgcctccg cctattacag aggaagatgc 240 

ccaggatatg gatgcctata ccctggccaa ggcctacttt gacgttaaag agtatgatcg 300 

ggcagcacat ttcctgcatg gctgcaatag caagaaagcc tattttctgt atatgtattc 3 60 

cagatatctg tctggagaaa aaaagaagga cgatgaaaca gttgatagct taggccccct 420 

ggaaaaagga caagtgaaaa atgaggcgct tagagaattg agagtggagc tcagcaaaaa 480 

acaccaagct cgagaacttg atggatttgg actttatctg tatggtgtgg tgcttcgaaa 54 0 

actggacttg gttaaagagg ccattgatgt gtttgtggaa gctactcatg ttttgccctt 600 

gcattgggga gcctggttag aactctgtaa cctgatcaca gacaaagaga tgctgaagtt 660 

cctgtctttg ccagacacct ggatgaaaga gttttttctg gctcatatat acacagagtt 720 

gcagttgata gaggaggccc tgcaaaagta tcagaatctc attgatgtgg gcttctctaa 7 80 

gagctcgtat attgtttccc aaattgcagt tgcctatcac aatatcagag atattgacaa 84 0 

agccctctcc atttttaatg agctaaggaa acaagaccct tacaggattg aaaatatgga 900 

cacattctcc aaccttcttt atgtcaggag catgaaatcg gagttgagtt atctggctca 960 

taacctctgt gagattgata aataccgtgt agaaacgtgc tgtgtaattg gcaattatta 1020 

cagtttacgt tctcagcatg agaaagcagc cttatatttc cagagagccc tgaaattaaa 10 80 

tcctcggtat cttggtgcct ggacactaat gggacatgag tacatggaga tgaagaacac 1140 

gtctgctgct atccaggctt atagacatgc cattgaggtc aacaaacggg actacagagc 120 0 

ttggtatggc ctcgggcaga cctatgaaat ccttaagatg ccattttact gcctttatta 12 60 

ttatagacgg gcccaccagc ttcgacccaa tgattctcgc atgctggttg ctttaggaga 1320 

atgttacgag aaactcaatc aactagtgga agccaaaaag tgttattgga gagcttacgc 13 80 

cgtgggagat gtggagaaaa tggetctggt gaaactggca aagcttcatg aacagttgac 1440 

tgagtcagaa caggctgccc agtgttacat caaatatatc caagatatct attcctgtgg 1500 

ggaaatagta gaacacttgg aggaaagcac tgcctttcgc tatctggccc agtactattt 1560 

taagtgcaaa ctgtgggatg aagcttcaac ttgtgcacaa aagtgttgfcg catttaatga 162 0 



-198- 



WO 02/092000 



PCT/US02/14877 



tacccgggaa gaaggtaagg ccttactccg gcaaatccta cagcttcgga accaaggcga 168 0 

gactcctacc accgaggtgc ctgctccctt tttcctacct gcttcactct ctgctaacaa 174 0 

tacccccaca cgcagagttt ctccactcaa cttgtcttct gtcacgccat agttggctac 1800 

tctcaagcca gcacattgtt agacccatct taattaagcc ttacctccat gtaaagaaca 1860 

gcacgtctgt tccaaggacc tcagctcttc ttgtttctac agatggcaac agctccatag 1920 

ggacagcttg tataattacc ttcagaggcc aactgacaga atcctggcag gaacagacat 1980 

tatcttgcca gttagaagta cttctgtctc acttatgtcc aaagagtggc tatagatctt 2 040 

ggccttcttc cctgaatgct tttttttttt ggcccccaag aaagtccctt ttatagcact 2100 

ttagcacagg caatgctaca ggaacaaagt ttcaatgctg ctgagagtga aagaaaggag 2160 

gaaagtctgc cactctaccc tgagctggca gtagggcact gagtaccctt aggaagaagt 222 0 

cagagcaatg gatacaaatg accttgctct tggatttgct gagcatgatc cctattctga 2280 

tgtcagagat taggtttaaa tggaatagag ctatccattt gttcttactc tctagggaga 2340 

caatcttcca aaacagtttt gggggggtct tctaaagctt tcaaattgga agtaacttta 24 0 0 

ttcaactaga gttgaataaa agaagggcaa aaataatctc acagagcttg gaactgctga 2460 

tagcccttac tgagggcaaa agatggctat attgttagct atactcctac caaagcaagc 252 0 

aaggagatag gattatagat aatttcacgg acatttggaa ataacattgg tgattataca 2580 

gacaagaata aactcacttc aagctggtct gttttaataa attttcaacg taattgtcta 2640 

tttttttccc tcccatctgc aacagaatac atttttttca gcctttatct agatgaggta 2700 

aagggaatca ttcttatggt gctcttggag agtttcaggc ctgtgcatgt gtgtacagca 2 760 

ggaggtaata tgctataatg tctgctgtaa tatatttgca cagtagatgc tatggatcat 2 82 0 

tctgagctca gggtccagac tttattctta ttcccagaat tttgtgttac gtttttacct 2880 

cctaacatat gacacttcat cttatattaa ggaaggttta gaatatctaa tacgacttga 2940 

attcatttgt tactaagcct tctcaggcaa gctgtatact agttactggt ctccactgcc 3000 

atgccttttc aaggttccca tggtccagaa tgatgtttga ttcttaattt ttctgtccct 3060 

tttataattt gttttaatga ttttgctaca tttggaattc aataaaaaat gtgaacaata 3120 

ataaaaaaaa aaaaaaaaaa 3140 

<210> 648 
<211> 3513 
<212> DNA 

<213> Homo sapiens 
<400> 648 

ccgtgtacca ggtgctgcta gtgggaagca cgctgctgaa ggaagtgcct tccgggctgc 60 

agctggagca gttgccttct cagagcctgc tgacccacat cccaacggcg gggctgccca 12 0 

cttcgctagg aggaggcctg ccttactgcc accaggcctg gctggatttc cgaaggcggc 180 

tggaagctct actacagaac tgccaggcag cttgtgccct gctccagggg gccatcgaaa 240 

gtgtgaaggc tgtgccccag cccatggagc ctggggaggt cggtcagctg ctacagcaga 300 

cagaggtcct gatgcagcag gtgctagact cgccatggct ggcatggcta caatgccagg 360 

ggggccggga gctgacatgg ctgaagcaag aggtcccaga ggtgaccctg agcccagact 42 0 

acaggacggc aatggacaag gctgacgagc tatatgaccg ggtggatgga ttgctgcacc 48 0 

aactgaccct gcagagcaac cagcgaatac aggccctaga gttggtccaa acactggagg 54 0 

cccgggaaag cggactgcac cagattgaag tgtggctgca gcaggtgggc tggccagcac 600 

tggaggaggc tggggagccc tcgctggaca tgctgctcca ggcccaaggc tcttttcagg 660 

agctgtacca ggttgcccag gagcaggtca ggcaagggga gaagtttctg cagccgctga 720 

ctggctggga ggcggctgaa ctggaccccc ctggggcacg ctttctggcc ctgcgagccc 780 

agctgactga attctctagg gctttggccc agcggtgcca gcggctggcg gatgctgaga 84 0 

ggctgtttca gctcttcagg gaggccttga cgtgggctga ggaggggcag cgagtgttgg 90 0 

cagagctgga gcaggaacgc ccgggggttg tgttgcagca gctgcagctg cactggacca 960 

ggcaccctga cttgcctcct gcccacttcc gaaagatgtg ggctctggcc acggggctgg 102 0 

gctcagaggc catccgccag gagtgccgct gggcctgggc gcggtgccag gacacctggc 108 0 

tggccctgga ccaaaagctt gaggcttcac tgaagctacc accggtgggc agcacagcta 1140 

gcctgtgtgt cagccaggtc cccgctgcac ctgcccaccc tcccctgagg aaggcctaca 1200 

gcttcgatcg gaatctgggg cagagtctca gtgaacctgc ctgccactgc caccatgcgg 1260 
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ccactattgc tgcctgccgc agaccagagg ctggaggagg tgccctgccc caggcatccc 132 0 

ctactgtgcc tccaccaggc agctctgacc ccaggagcct caacaggcta cagctggtgc 13 80 

tggcagagat ggtggccacg gagcgggagt atgtccgggc tctagagtac actatggaga 1440 

actatttccc cgagctggat cgccccgatg tgccccaggg cctccgcggt cagcgtgccc 1500 

acctctttgg caacctggag aagctgcggg acttccactg ccacttcttc ctgcgtgagc 1560 

tggaggcttg cacccggcac ccaccacgag tggcctatgc cttcctgcgc catagggtgc 162 0 

agtttgggat gtacgcgctc tacagcaaga ataagcctcg ctccgatgcc ctgatgtcaa 1680 

gctatgggca caccttcttc aaggacaagc agcaagcact gggggaccac ctggacctgg 1740 

cc tcctacct gctaaagccc atccagcgca tgggcaagta cgcactgctg ctgcaggagc 1800 

tggcacgggc ctgcgggggc cccacgcagg agctcagtgc gctgcgggag gcccagagcc 1860 

ttgtgcactt ccagctgcgg cacggaaacg acctgctggc catggacgcc atccagggct 192 0 

gtgatgttaa cctcaaggaa caggggcagc tggtgcgaca ggatgagttt gtggtgcgca 1980 

ctgggcgcca caagtccgtg cgccgcatct tcctttttga ggagctgctg ctcttcagca 2040 

agcctcgcca tgggcccaca ggggttgaca catttgccta caagcgctcc ttcaagatgg 2100 

cagaccttgg tctcactgag tgctgtggga acagcaacct gcgcttcgag atctggttcc 2160 

gccgccgcaa ggccagggac acctttgtgc tgcaggcctc cagcctggct atcaagcagg 2220 

cctggacagc tgacatctcc cacctgcttt ggaggcaggc cgtccacaac aaggaggtgc 22 80 

gcatggctga gatggtgtcc atgggtgtgg ggaacaaggc cttccgagac attgctccca 2340 

gcgaggaagc catcaacgac cgcaccgtca actatgtcct gaagtgccga gaagttcgct 24 00 

ctcgggcgtc cattgccgta gccccgtttg accatgacag cctctacctg ggggcctcga 24 60 

actcccttcc tggagaccct gcctcttgct ctgttctggg gtccctcaac ctgcacctgt 2520 

acagagaccc agctcttctg ggtctccgct gtcccctgta tcccagcttc ctagaggaag 2580 

cagcactgga ggctgaggca gagctgggcg gccagccctc tttgactgct gaggactcag 2640 

agatctcgtc ccaatgccca tcagccagtg gctccagtgg ctctgacagc agctgtgtgt 2700 

cagggcaggc cctgggtagg ggcctggagg acttaccctg tgtctgagcc cgggactgga 2760 

cgagcagtag atccagcagc ctgcagctcc aaggaacatt gcctctctgg atctgctgtg 282 0 

accagggtgt ggctgacacc tgggctacct ccaacctaca tgtgcaacgc tgttgactac 28 80 

cctttctgat gtgtgtggcc attggactaa ctggcacggg gcctctctag ggaagtctgg 2940 

ttgtagagcc tgaataggct cctggcccca tgaccccttc tcctgtcccc agctcccatc 3 000 

ccagttgtgg gttaagaata ggctagagca gacattgggt gtttccatgc tgtaggctgg 3060 

tgggggacca tgtgcctcta ggcagtgact agggtgcccc cacccctcag gaagaacaca 312 0 

ggtgggctcc tagcagctga tccccaatgc ctggccttaa agccgagctc agttaccata 3180 

gggacaggtc cacctctact gggccctcat gcttgccttt cctggccccc aggcccagcc 3240 

cctttttact ggggcagttt cgttattttg acttgatgcc ttttgaataa ctttcaatag 33 00 

aattgtctaa aattatctta ctggttgtta ggcctttggt gtctcagaga aggagtctag 33 60 

gtctttgatg tgtgatttaa tcttttattt gtttataata aaaaatagac tgatttgtaa 3420 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 34 80 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 3513 



<210> 649 

<211> 3597 

<212> DNA 

<213> Homo sapiens 



<400> 649 

catgccagtt acttcctcag gaaaatattt 
atttgggaac agtggataac ccaagtgtcc 
tgatccctgc acccaaagcc agcccetaaa 
agtgagcttg gctgcttgtg aggagctaca 
caaagtttga ggcagactgt ccaggactgt 
aaaagtctct gacctggtcc cctcaggccc 
gatgtactgg cccagcatct ccacctcccc 
ccgttgcatg tctggcttct gcctgtgctc 
ctcattccac cgtgtctctc ctgcacacat 



tcttgccttc ttctttcagt atggttttaa 60 

cacaggccaa ggtatattcc aatggcagca 120 

gcctacccct tgtgcacccg cagcctggta 180 

agtgaaagag aagttatttt aaataaatcc 240 

tcccaggaag aagcaggagt tacccacagg 3 00 

agctacctgc gcccaccagc agtgaaggtt 360 

catgcaacca ggtccctggt accgtgtctc 420 

cfccctgccac gagcatcctc cctgtccctc 480 

agcctctgtc ccagggcgat ttatccactt 540 
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gagtacagga gctgctcaga cctctcagcc cagccctctg tgactgcccc agccccatcc 600 

taccccaccc aaagctgcct tcctggctgt aggagctccc tcgtctagcc aaggccctat 660 

gggtccccat ccgaggatcc acaagcaatg acttcccaaa tgacctccac tgcaagaaga 72 0 

atccttacca ctgtttccag agccgtgaac gatgctgtga tggcccaggt ctcagcacca 7 80 

ccctctgtga cctaaaaaga aaagctcaat ttccatctgt cttctttccc aggaccaagg 840 

ggacacagta atgtgaagtc aaatacttaa ccgagcaaag ggccagtgtt gttatcagtc 900 

aaggacaaac ctcccacctc acagacagcc aagcagtgag ggaaagacag acagacatag 960 

gtaggaaggt gctctgcagg cacaaggccc agagaagccc ctctccggga acttcccctg 1020 

ctccttccag gaacagtgag cccagtgagc agtcccagcc agctcttcaa ggccttcaag 1080 

gggtctttcc atgactgagt cacctccagg agctcacctg acccccagag aagacctacc 114 0 

ccaggcagct ccgtgccctg gcttctcccc atgccccaaa tcccccccag ccatccctcc 1200 

tggtcctcgt ctacatcaag ggcctcttcc cctcttcctc ccagctctca ggacaggtga 1260 

ctgggagacc tfcgaaccctc agcctcttcc tttaaaaaaa acaaaacaaa acaaaactgt 132 0 

gggccattta tttgggattt tggagttgtt tggtttttgt ttttatatct taatagttcg 13 80 

aaagtaagaa gggagccctg ctatggatgt taagtccaaa ttactcggtt agtgggagca 1440 

aaacctatga cttccaaggg gatgaggaga ggttcagagg acaggaggag cctcccccat 15 0 0 

tgaaaaaaaa aaaatgggtc aggacattcc ctggatgagg acaatgctag gggtggcatc 15 60 

tcacatggct gctgctattc ctggtgcttc cccacacttt tgacagatgg agtccttctc 162 0 

ctaccgcctc ctgccacctc accctacagg cattctctat gtaggaaaca agagccttat 1680 

cttatagagt ggggagctga gacacagcct caggtaacac tgacacagct cccgaatgag 1740 

gctgggacac tctgcaaacc tctcctcatg gtgctaaggg tggcatgctc ttgacaggaa 1800 

acctaaatga ccactcctct catttggaaa gtaatccact gcagtaaaag tttcagacat 1860 

gcaagagaga gttttttttt ttttttacta caaatttttg ctcccccata aaattatttt 192 0 

tttattagag ggagtatcca agttttaaaa gtatatagaa ttttttggtt gtaagagaaa 1980 

tacatactca ttaggatccc gattaaattc cttgagtaga ctggtgccta ccagaaagca 2040 

aagcaaagtt aaacaaaacg aaacaaaatc cttcatatac aaaaagaact ttctgtttgt 2100 

attggcagag gtagtgaggt gattcaggta ggctgaaaat cctgggttgc gggagcctca 2160 

ctttattcca ttcccacccg ctttgatgtc tatgcttggc tctctgggct gcccctggta 2220 

ctgccgaatc ctacacatct cttatcagct ttcctcaaac tttaaggagg ctctgtgagg 22 80 

gatgggtcat gggaagaccc aagctttccc tccgccagga ttgcaaaagc aagtagactt 2340 

ggtctatgca gctcttcttc caacaatttc tttatttgga attagaactt cctttgttag 24 00 

tatctttgat cttttgactc aagcacattt tggaagggct cccttacaaa agtagaattt 2460 

aaaacagagg atacagttaa agagcaaccc aaaggacgct taagaaaccg agaccacttc 2520 

accgaacagg actaaggaac actttcgtgc acagaagtca gccgcaatcc aggcacagga 25 80 

cgaagatggg atacacgtgc tcatctgtct gtcctccttt cctctccctc cccgacgttc 2640 

tagttagctt gttgacttgt taaaccttct gttcttaaaa tgaaaagcta gcttacctca 2700 

aagaatcttg tttccattcg gaaaccaacg attttgtgtt ttagaatgga cagccctccc 2760 

ctcaccactc cctaccttgg cctggtgtcc ttgagacata cggtctttgc ttagtcgtgt 2820 

gttggctgct ttgagcagga acgaggcctc caggccctga ggtgggaagg aaggattgga 28 80 

tgccactgcc ctcctcccca ctttagcatg taggggccag cccatctctt ccagcagggt 2940 

cctgctgagt taccatagca accagcaact ccagggtacc acaacagaca atggctcagc 3000 

gagccgacgt gtggggatga tgcaggggtt ttggcccagc cagaggaccc agagttgagc 3 060 

ttcaaatgct agagaagggg agaaacagga tggaagggtg gtttaaggaa ctggcagggg 3120 

tctttgagtc acatagagaa gccgttgaag gaggtagggc aggttatctc tgttccagtc 3180 

acccccttcc agccccatcc cacttctgtt tcaaactaaa gctcccacct cgaacattga 3240 

ccctttgtta gaacaaagca aagcatatct ttaaacaaca gtgttaaaat gagcctcaaa 3300 

tgtatgtgga tgagatctct aagaagaggg tcttctggtt ttgattttta aagaagagta 33 60 

tcctagtaaa atattaaaaa aaaattaaaa agtttttaaa aaggaaacct atgctattta 3420 

aattggagcc cagttgtaac ttggtaaagg caagcttctg tacctttgtt ataattaatt 3480 

gtatacctgt gtatgtaaat ataaggcatt cctattttgc agttcagaac aaaaaaaact 3540 

tatttgtaat atagaataaa gtttattaaa aaataagaaa aaaaaaaaaa aaaaaaa 35 97 

<210> 650 
<211> 855 
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<212> DNA 

<213> Homo sapiens 

<400> 650 

cgctcaatta tctactcgag tctagactcg aggcggccgc ccattgtgca ctaaagcagg 60 

ggatagcaac ggcgtccctc ctccccgctc agctgcagcc cgcagtcctc acagtggtaa 12 0 

catgccacgt ggtagtctct gtccatggac accacacgga tggttgtctc gcagccctgt 180 

gcagggagga taggacgggc acaggaggcg cattttggtg caaaaaccgt gtgatagtct 240 

cgcacgcagt agatgttgtt ctccacgtcc acggtgaagg gaaccccgtc caggcactca 300 

ttgcacacgg agcaccggaa gcagcctggg tggtaggact tgcccagggc ctgcaggatc 360 

atttccatga tgagatgtcc acacacgctg catttgtcgg ccgtctgctg gaacccggag 42 0 

tacaggaagt cctcctggca gtacactttc tcacccacgt tgtagaacgc cttcccacgg 480 

agtcgtctcc cacacgagtc gcaggtgaag cagtcagtgt gataaagact ccccattgcc 54 0 

tggcacgcct gctgggctcc gtagatgcca agcccacact tgatgcaaat gccgaagtag 600 

tcccgcgccg tgcgcgcctc gagcgcccgc tccagctccc gggtgagcgc ctccagccgc 660 

cgctcggccg cgcttgggcc gccctcccgg ccagggggca gcgggagtgc aggcagcggg 720 

aagggggccg gccccgcagg ctccggggag cgagcgggcg cggggcaggc gccgggcggg 7 80 

aggaagtcag cgtaggcttt agtgcacaat gggaattcgg atccggtcga cactagttct 840 

aggatctctt ttttt 855 

<210> 651 

<211> 3791 

<212> DNA 

<213> Homo sapiens 

<400> 651 

acagacggcg ggtgaacatg gcgtcctcga cttggtctga gacgtgatag gcctgccttc 60 

tggttgaaga tgtggcgagt gaaaaaactg agcctcagcc tgtcgccttc gccccagacg 120 

ggaaaaccat ctatgagaac tcctctccgt gaacttaccc tgcagcccgg tgccctcacc 180 

acctctggaa aaagatcccc cgcttgctcc tcgctgaccc catcactgtg caagctgggg 24 0 

ctgcaggaag gcagcaacaa ctcgtctcca gtggattttg taaataacaa gaggacagac 300 

ttatcttcag aacatttcag tcattcctca aagtggctag aaacttgtca gcatgaatca 360 

gatgagcagc ctctagatcc aattccccaa attagctcta ctcctaaaac gtctgaggaa 42 0 

gcagtagacc cactgggcaa ttatatggtt aaaaccatcg tccttgtacc atctccactg 480 

gggcagcaac aagacatgat atttgaggcc cgtttagata ccatggcaga gacaaacagc 54 0 

atatctttaa atggaccttt gagaacagac gatctggtga gagaggaggt ggcaccctgc 600 

atgggagaca ggttttcaga agttgctgct gtatctgaga aacctatctt tcaggaatct 660 

ccgtcccatc tcttagagga gtctccacca aatccctgtt ctgaacaact acattgctcc 72 0 

aaggaaagcc tgagcagtag aactgaggct gtgcgtgagg acttagtacc ttctgaaagt 780 

aacgccttct tgccttcctc tgttctctgg ctttcccctt caactgcctt ggcagcagat 840 

ttccgtgtca atcatgtgga cccagaggag gaaattgtag agcatggagc tatggaggaa 90 0 

agagaaatga ggtttcccac acatcctaag gagtctgaaa cagaagatca agcacttgtc 960 

tcaagtgtgg aagatattct gtccacatgc ctgacaccaa atctagtaga aatggaatcc 1020 

caagaagctc caggcccagc agtagaagat gttggtagga ttcttggctc tgatacagag 1080 

tcttggatgt ccccactggc ctggctggaa aaaggtgtaa atacctccgt catgctggaa 114 0 

aatctccgcc aaagcttatc ccttccctcg atgcttcggg atgctgcaat tggcactacc 1200 

cctttctcta cttgctcggt ggggacttgg tttactcctt cagcaccaca ggaaaagagt 1260 

acaaacacat cccagacagg cctggttggc accaagcaca gtacttctga gacagagcag 132 0 

ctcctgtgtg gccggcctcc agatctgact gccttgtctc gacatgactt ggaagataac 13 80 

ctgctgagct ctcttgtcat tgtggagttt ctctcccgcc agcttcggga ctggaagagc 1440 

cagctggctg tccctcaccc agaaacccag gacagtagca cacagactga cacatctcac 150 0 

agtgggataa ctaataaact tcagcatctt aaggagagcc atgagatggg acaggcccta 1560 

cagcaggcca gaaatgtcat gcaatcatgg gtgcttatct ctaaagagct gatatccttg 162 0 

cttcacctat ccctgttgca tttagaagaa gataagacta ctgtgaatca ggagtctcgg 1680 
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cgtgcagaaa 
cagagcctca 
aaggatgcgg 
cagctggaac 
acccaactgg 
acttctacct 
ttgctgagtc 
caggaacgtg 
tctgcccagt 
ctagcaacag 
gagctacaga 
tgccagctta 
ctaaaacaca 
gacctgaaag 
ggtcaggttg 
tgtgagaacc 
gataaccagg 
actgagcaac 
caagagaccc 
gacaggacct 
cctgtgccct 
cttcagcccg 
actgagcttc 
ctgcagcgaa 
gaagtccaga 
tacaagaatg 
atagacaaga 
cttcggcgtg 
ccaactgcca 
tggacaaact 
gttccagagc 
aaaaactaga 
tgaggggatg 
agaatttgtt 
ggcatagagc 
aaaaaaaaaa 



cattggtctg 
aagcagaaag 
cagagatagt 
aggacctagc 
tagggcttca 
tgcaacaaga 
ggtcccgaca 
atgtggcaat 
tagaggagtg 
atctccgggc 
gtcagcatac 
cccagagcaa 
tgcaggcaga 
agaccttgga 
agtgtcaatt 
tcaaggacac 
agcaagatct 
tacagagcct 
ttctgctgag 
tcctgggaag 
tgcttggaag 
cagagacccc 
agagtctttg 
aaatttgtga 
aggcaaaaga 
aaaaggagct 
gtggcgagct 
cggagacaga 
gcctatggcc 
gagagtgatg 
ccatagaaat 
tgacattgtt 
caaagaacta 
tcacctcttt 
caactgagat 
a 



ttgctgtttt 
ggaggaggca 
gttggaggct 
atccatgcgg 
tgccaagcaa 
ctggaggtcc 
actcacagag 
tgaggaaaag 
caaaggccaa 
tcagttgcag 
ccattgtgcc 
tgaggagcag 
actgcagcag 
gtttgcagac 
gaaaaccaca 
tgtagagaac 
ggagaaaaca 
gactctcttt 
tacagcctgt 
catcttgaca 
tgacaagagt 
aggcatggag 
ttccctgcta 
gctgcaagct 
agcagacata 
ccaggaagtg 
cataagcctt 
gaccaaagtg 
accaattgga 
ttcctggaga 
atcctagagg 
cagcatattt 
cagggattgc 
ttacctgcaa 
aaatgctatt 



gatttgctga 
aggcacagag 
ttctgtgcac 
gaattcagag 
gaagagctgg 
atgcaactgg 
aaactcacag 
caggaggttt 
acagaacaac 
attctggcca 
caggacctgg 
gctgctcaat 
caacaagctg 
caggagaatc 
ctggaagtgc 
ctaacggcta 
cggcagtact 
ctacagacaa 
cctcccaccc 
gcagtggcag 
gctttcaccc 
gagagcctgg 
caagagtcta 
aggctgcagg 
gagaagctga 
atacagcaga 
agagaggagg 
ctccaggagg 
tccaggagaa 
tgaaaaatga 
agaaccttcg 
ataagaccct 
tggaatttct 
taccccctta 
taaataaagt 



agaaattgag 
aggaaatggc 
acgccagcca 
gccttctgaa 
ttcagcagac 
attatacaac 
tcaagagcca 
ctagggtgct 
tggagttgga 
acatggacag 
ctatgaagga 
gcgtaaagga 
tcctggccaa 
aggttgctca 
tccgggagcg 
aactggccag 
ctcaaaagct 
aactaaagga 
aggaacaccc 
atgaagagcc 
gagtagcatc 
cagaaatgag 
aagaagaagc 
cccaggaaga 
accaggcctt 
atgagaagat 
tgacccacct 
cctggcaggc 
agtgtggctc 
gaaggaaaac 
gcgctctgac 
gctctctatt 
gagctaagaa 
ccccaatacc 
gtatttaatg 



ggcaaagctc 
tctcagaggc 
gcgcatcagc 
ggatgcccag 
agtgagtctt 
atggacagct 
gcaagccctg 
ggaacaagtc 
aaacattcgt 
ccagctaaaa 
tgagttactc 
agagatggca 
agaggtgcgg 
cctggagctg 
cagcttgcag 
caccatagca 
agggctgctg 
gaagactgaa 
tctgcctaat 
agaatcaact 
aatggtttcc 
tattatgact 
catcaggact 
acagcatcag 
gtgcttgcgc 
cctagaacag 
tacccgctca 
cagctggact 
tctcaggagg 
tcctgatcaa 
aaggagttag 
ccagaggtgg 
actgaaagcc 
aagaccaact 
aaaaaaaaaa 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3791 



<210> 652 

<211> 1683 

<212> DNA 

<213> Homo sapiens 

<400> 652 

ctctgagtgt ccagtggtca gttgccccag 
cttgacctat ttggctgttg cttctgctgc 
gaggcagctg aggccagagc actttcaaga 
cctatcccga agcctcccca tggatcaccc 
gggacagagt caagtgcagc cccctccctc 
gctgctacct gcccaactcc ctgctgaaaa 
tgtccccctc caaaaagagc tgccctctct 
gccagctcca tttggggacc agagccatcc 
ctgccaacag gaccggtccc aagggggctg 
gcggccttct ccagacaatc tgaaccaaat 
tggtccctgg aacctaccac agtccagcta 



gatggggacc acagccagag cagccttggt 60 

ctctgaggga ggcttcacgg ctacaggaca 120 

agttggctac gcagctcccc cctccccacc 180 

tgactcctct cagcatggcc ctccctttga 240 

tcaggaggcc acccctctcc aacaggaaaa 3 00 

ggaagtgggt ccccctctcc ctcaggaagc 3 60 

ccagcacccc aatgaacaga aggaaggaac 420 

agaacctgag tcctggaatg cagcccagca 480 

gggccaccgg ctggatggct tcccccctgg 540 

ctgccttcct aaccgtcagc atgtggtata 600 

ctcccacctc actcgccagg gtgagaccct 660 
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caatttcctg gagattggat attcccgctg ctgccactgc cgcagccaca caaaccgcct 72 0 

agagtgtgcc aaacttgtgt gggaggaagc aatgagccga ttctgtgagg ccgagttctc 780 

ggtcaagacc cgaccccact ggtgctgcac gcggcagggg gaggctcggt tctcctgctt 84 0 

ccaggaggaa gctccccagc cacactacca gctccgggcc tgccccagcc atcagcctga 90 0 

tatttcctcg ggtcttgagc tgcctttccc tcctggggtg cccacattgg acaatatcaa 960 

gaacatctgc cacctgaggc gcttccgctc tgtgccacgc aacctgccag ctactgaccc 102 0 

cctacaaagg gagctgctgg cactgatcca gctggagagg gagttccagc gctgctgccg 1080 

ccaggggaac aatcacacct gtacatggaa ggcctgggag gatacccttg acaaatactg 114 0 

tgaccgggag tatgctgtga agacccacca ccacttgtgt tgccgccacc ctcccagccc 12 00 

tactcgggat gagtgctttg cccgtcgggc tccttacccc aactatgacc gggacatctt 1260 

gaccattgac atcagtcgag tcacccccaa cctcatgggc cacctctgtg gaaaccaaag 132 0 

agttctcacc aagcataaac atattcctgg gctgatccac aacatgactg cccgctgctg 13 8 0 

tgacctgcca tttccagaac aggcctgctg tgcagaggag gagaaattaa ccttcatcaa 144 0 

tgatctgtgt ggtccccgac gtaacatctg gcgagaccct gccctctgct gttacctgag 150 0 

tcctggggat gaacaggtca actgcttcaa catcaattat ctgaggaacg tggctctagt 1560 

gtctggagac actgagaacg. ccaagggcca gggggagcag ggctcaactg gaggaacaaa 162 0 

tatcagctcc acctctgagc ccaaggaaga atgagtcacc ccagagccct agagggtcag 1680 

atg 1683 

<210> 653 
<211> 1696 
<212> DNA 

<213> Homo sapiens 
<400> 653 

cacctaaaag ccaaaatggg aaaggaaaag actcatatca acattgtcgt cattggacac 60 

gtagattcgg gcaagtccac cactactggc catctgatct ataaatgcgg tggcatcgac 12 0 

aaaagaacca ttgaaaaatt tgagaaggag gctgctgaga tgggaaaggg ctccttcaag 180 

tatgcctggg tcttggataa actgaaagct gagcgtgaac gtggtatcac cattgatatc 24 0 

tccttgtgga aatttgagac cagcaagtac tatgtgacta tcattgatgc cccaggacac 3 00 

agagacttta tcaaaaacat gattacaggg acatctcagg ctgactgtgc tgtcctgatt 3 60 

gttgctgctg gtgttggtga atttgaagct ggtatctcca agaatgggca gacccgagag 420 

catgcccttc tggcttacac actgggtgtg aaacaactaa ttgtcggtgt taacaaaatg 480 

gattccactg agccacccta cagccagaag agatatgagg aaattgttaa ggaagtcagc 540 

acttacatta agaaaattgg ctacaacccc gacacagtag catttgtgcc aatttctggt 600 

tggaatggtg acaacatgct ggagccaagt gctaacatgc cttggttcaa gggatggaaa 660 

gtcacccgta aggatggcaa tgccagtgga accacgctgc ttgaggctgt ggactgcatc 720 

ctaccaccaa ctcgtccaac tgacaagccc ttgcgcctgc ctctccagga tgtctacaaa 780 

attggtggta ttggtactgt tcctgttggc cgagtggaga ctggtgttct caaacccggt 840 

atggtggtca cctttgctcc agtcaacgtt acaacggaag taaaatctgt cgaaatgcac 90 0 

catgaagctt tgagtgaagc tcttcctggg gacaatgtgg gcttcaatgt caagaatgtg 960 

tctgtcaagg atgttcgtcg tggcaacgtt gctggtgaca gcaaaaatga cccaccaatg 1020 

gaagcagctg gcttcactgc tcaggtgatt atcctgaacc atccaggcca aataagcgcc 1080 

ggctatgccc ctgtattgga ttgccacacg gctcacattg catgcaagtt tgctgagctg 1140 

aaggaaaaga ttgatcgccg ttctggtaaa aagctggaag atggccctaa attcttgaag 12 00 

tctggtgatg ctgccattgt tgatatggtt cctggcaagc ccatgtgtgt tgagagcttc 12 60 

tcagactatc cacctttggg tcgctttgct gttcgtgata tgagacagac agttgcggtg 132 0 

ggtgtcatca aagcagtgga caagaaggct gctggagctg gcaaggtcac caagtctgcc 1380 

cagaaagctc agaaggctaa atgaatatta tccctaatac ctgccacccc actcttaatc 144 0 
agtggtggaa gaacggtctc agaactgttt gtttcaattg gccatttaag tttagtagta . 1500 

aaagactggt taatgataac aatgcatcgt aaaaccttca gaaggaaagg agaatgtttt 1560 

gtggaccact ttggttttct tttttgcgtg tggcagtttt aagttattag tttttaaaat 1620 

cagtactttt taatggaaac aacttgacca aaaatttgtc acagaatttt gagacccatt 1680 

aaaaaagtta aatgag 1696 
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<210> 654 

<211> 7680 

<212> DNA 

<213> Homo sapiens 

<400> 654 

gaagagcaag aggcaggctc agcaaatggt tcagccccag tccccggtgg ctgtcagtca 60 

aagcaagccc ggttgttatg acaatggaaa acactatcag ataaatcaac agtgggagcg 120 

gacctaccta ggtaatgtgt tggtttgtac ttgttatgga ggaagccgag gttttaactg 180 

cgaaagtaaa cctgaagctg aagagacttg ctttgacaag tacactggga acacttaccg 240 

a-gtgggtgac acttatgagc gtcctaaaga ctccatgatc tgggactgta cctgcatcgg 300 

ggctgggcga gggagaataa gctgtaccat cgcaaaccgc tgccatgaag ggggtcagtc 360 

ctacaagatt ggtgacacct ggaggagacc acatgagact ggtggttaca tgttagagtg 420 

tgtgtgtctt ggtaatggaa aaggagaatg gacctgcaag cccatagctg agaagtgttt 4 80 

tgatcatgct gctgggactt cctatgtggt cggagaaacg tgggagaagc cctaccaagg 540 

ctggatgatg gtagattgta cttgcctggg agaaggcagc ggacgcatca cttgcacttc 600 

tagaaataga tgcaacgatc aggacacaag gacatcctat agaattggag acacctggag 660 

caagaaggat aatcgaggaa acctgctcca gtgcatctgc acaggcaacg gccgaggaga 72 0 

gtggaagtgt gagaggcaca cctctgtgca gaccacatcg agcggatctg gccccttcac 780 

cgatgttcgt gcagctgttt accaaccgca gcctcacccc cagcctcctc cctatggcca 840 

ctgtgtcaca gacagtggtg tggtctactc tgtggggatg cagtggttga agacacaagg 90 0 

aaataagcaa atgctttgca cgtgcctggg caacggagtc agctgccaag agacagctgt 960 

aacccagact tacggtggca acttaaatgg agagccatgt gtcttaccat tcacctacaa 102 0 

tggcaggacg ttctactcct gcaccacgga agggcgacag gacggacatc tttggtgcag 10 80 

cacaacttcg aattatgagc aggaccagaa atactctttc tgcacagacc acactgtttt 114 0 

ggttcagact caaggaggaa attccaatgg tgccttgtgc cacttcccct tcctatacaa 12 00 

caaccacaat tacactgatt gcacttctga gggcagaaga gacaacatga agtggtgtgg 1260 

gaccacacag aactatgatg ccgaccagaa gtttgggttc tgccccatgg ctgcccacga 132 0 

ggaaatctgc acaaccaatg aaggggtcat gtaccgcatt ggagatcagt gggataagca 13 8 0 

gcatgacatg ggtcacatga tgaggtgcac gtgtgttggg aatggtcgtg gggaatggac 1440 

atgcattgcc tactcgcaac ttcgagatca gtgcattgtt gatgacatca cttacaatgt 1500 

gaacgacaca ttccacaagc gtcatgaaga ggggcacatg ctgaactgta catgcttcgg 1560 

tcagggtcgg ggcaggtgga agtgtgatcc cgtcgaccaa tgccaggatt cagagactgg 162 0 

gacgttttat caaattggag attcatggga gaagtatgtg catggtgtca gataccagtg 168 0 

ctactgctat ggccgtggca ttggggagtg gcattgccaa cctttacaga cctatccaag 1740 

ctcaagtggt cctgtcgaag tatttatcac tgagactccg agtcagccca actcccaccc 1800 

catccagtgg aatgcaccac agccatctca catttccaag tacattctca ggtggagacc 1860 

taaaaattct gtaggccgtt ggaaggaagc taccatacca ggccacttaa actcctacac 192 0 

catcaaaggc ctgaagcctg gtgtggtata cgagggccag ctcatcagca tccagcagta 1980 

cggccaccaa gaagtgactc gctttgactt caccaccacc agcaccagca cacctgtgac 2040 

cagcaacacc gtgacaggag agacgactcc cttttctcct cttgtggcca cttctgaatc 2100 

tgtgaccgaa atcacagcca gtagctttgt ggtctcctgg gtctcagctt ccgacaccgt 2160 

gtcgggattc cgggtggaat atgagctgag tgaggaggga gatgagccac agtacctgga 2220 

tcttccaagc acagccactt ctgtgaacat ccctgacctg cttcctggcc gaaaatacat 2280 

tgtaaatgtc tatcagatat ctgaggatgg ggagcagagt ttgatcctgt ctacttcaca 2340 

aacaacagcg cctgatgccc ctcctgaccc gactgtggac caagttgatg acacctcaat 2400 

tgttgttcgc tggagcagac cccaggctcc catcacaggg tacagaatag tctattcgcc 2460 

atcagtagaa ggtagcagca cagaactcaa ccttcctgaa actgcaaact ccgtcaccct 2520 

cagtgacttg caacctggtg ttcagtataa catcactatc tatgctgtgg aagaaaatca 2580 

agaaagtaca cctgttgtca ttcaacaaga aaccactggc accccacgct cagatacagt 2640 

gccctctccc agggacctgc agtttgtgga agtgacagac gtgaaggtca ccatcatgtg 2700 

gacaccgcct gagagtgcag tgaccggcta ccgtgtggat gtgatccccg tcaacctgcc 2 760 

tggcgagcac gggcagaggc tgcccatcag caggaacacc tttgcagaag tcaccgggct 2820 

gtcccctggg gtcacctatt acttcaaagt ctttgcagtg agccatggga gggagagcaa 2880 
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gcctctgact gctcaacaga caaccaaact ggatgctccc actaacctcc agtttgtcaa 2940 

tgaaactgat tctactgtcc tggtgagatg gactccacct cgggcccaga taacaggata 3000 

ccgactgacc gtgggcctta cccgaagagg ccagcccagg cagtacaatg tgggtccctc 3060 

tgtctccaag taccccctga ggaatctgca gcctgcatct gagtacaccg tatccctcgt 3120 

ggccataaag ggcaaccaag agagccccaa agccactgga gtctttacca cactgcagcc 3180 

tgggagctct attccacctt acaacaccga ggtgactgag accaccatcg tgatcacatg 3240 

gacgcctgct ccaagaattg gttttaagct gggtgtacga ccaagccagg gaggagaggc 33 00 

accacgagaa gtgacttcag actcaggaag catcgttgtg tccggcttga ctccaggagt 3360 

agaatacgtc tacaccatcc aagtcctgag agatggacag gaaagagatg cgccaattgt 342 0 

aaacaaagtg gtgacaccat tgtctccacc aacaaacttg catctggagg caaaccctga 3480 

cactggagtg ctcacagtct cctgggagag gagcaccacc ccagacatta ctggttatag 3540 

aattaccaca acccctacaa acggccagca gggaaattct ttggaagaag tggtccatgc 360 0 

tgatcagagc tcctgcactt ttgataacct gagtcccggc ctggagtaca atgtcagtgt 3660 

ttacactgtc aaggatgaca aggaaagtgt ccctatctct gataccatca tcccagctgt 372 0 

tcctcctccc actgacctgc gattcaccaa cattggtcca gacaccatgc gtgtcacctg 3780 

ggctccaccc ccatccattg atttaaccaa cttcctggtg cgttactcac ctgtgaaaaa 3840 

tgaggaagat gttgcagagt tgtcaatttc tccttcagac aatgcagtgg tcttaacaaa 3900 

tctcctgcct ggtacagaat atgtagtgag tgtctccagt gtctacgaac aacatgagag 3960 

cacacctctt agaggaagac agaaaacagg 'tcttgattcc ccaactggca ttgacttttc 4020 

tgatattact gccaactctt ttactgtgca ctggattgct cctcgagcca ccatcactgg 4 080 

ctacaggatc cgccatcatc ccgagcactt cagtgggaga cctcgagaag atcgggtgcc 4140 

ccactctcgg aattccatca ccctcaccaa cctcactcca ggcacagagt atgtggtcag 42 00 

catcgttgct cttaatggca gagaggaaag tcccttattg attggccaac aatcaacagt 42 60 

ttctgatgtt ccgagggacc tggaagttgt tgctgcgacc cccaccagcc tactgatcag 4320 

ctgggatgct cctgctgtca cagtgagata ttacaggatc acttacggag aaacaggagg 4380 

aaatagccct gtccaggagt tcactgtgcc tgggagcaag tctacagcta ccatcagcgg 444 0 

ccttaaacct ggagttgatt ataccatcac tgtgtatgct gtcactggcc gtggagacag 4500 

ccccgcaagc agcaagccaa tttccattaa ttaccgaaca gaaattgaca aaccatccca 4560 

gatgcaagtg accgatgttc aggacaacag cattagtgtc aagtggctgc cttcaagttc 462 0 

ccctgttact ggttacagag taaccaccac tcccaaaaat ggaccaggac caacaaaaac 4680 

taaaactgca ggtccagatc aaacagaaat gactattgaa ggcttgcagc ccacagtgga 474 0 

gtatgtggtt agtgtctatg ctcagaatcc aagcggagag agtcagcctc tggttcagac 4800 

tgcagtaacc aacattgatc gccctaaagg actggcattc actgatgtgg atgtcgattc 4 860 

catcaaaatt gcttgggaaa gcccacaggg gcaagtttcc aggtacaggg tgacctactc 492 0 

gagccctgag gatggaatcc atgagctatt ccctgcacct gatggtgaag aagacactgc 498 0 

agagctgcaa ggcctcagac cgggttctga gtacacagtc agtgtggttg ccttgcacga 5040 

tgatatggag agccagcccc tgattggaac ccagtccaca gctattcctg caccaactga 5100 

cctgaagttc actcaggtca cacccacaag cctgagcgcc cagtggacac cacccaatgt 5160 

tcagctcact ggatatcgag tgcgggtgac ccccaaggag aagaccggac caatgaaaga 5220 

aatcaacctt gctcctgaca gctcatccgt ggttgtatca ggacttatgg tggccaccaa 5280 

atatgaagtg agtgtctatg ctcttaagga cactttgaca agcagaccag ctcagggtgt 5340 

tgtcaccact ctggagaatg tcagcccacc aagaagggct cgtgtgacag atgctactga 5400 

gaccaccatc accattagct ggagaaccaa gactgagacg atcactggct tccaagttga 5460 

tgccgttcca gccaatggcc agactccaat ccagagaacc atcaagccag atgtcagaag 552 0 

ctacaccatc acaggtttac aaccaggcac tgactacaag atctacctgt acaccttgaa 5580 

tgacaatgct cggagctccc ctgtggtcat cgacgcctcc actgccattg atgcaccatc 5640 

caacctgcgt ttcctggcca ccacacccaa ttccttgctg gtatcatggc agccgccacg 5700 

tgccaggatt accggctaca tcatcaagta tgagaagcct gggtctcctc ccagagaagt 5760 

ggtccctcgg ccccgccctg gtgtcacaga ggctactatt actggcctgg aaccgggaac 582 0 

cgaatataca atttatgtca ttgccctgaa gaataatcag aagagcgagc ccctgattgg 5880 

aaggaaaaag acagacgagc ttccccaact ggtaaccctt ccacacccca atcttcatgg 5940 

accagagatc ttggatgttc cttccacagt tcaaaagacc cctttcgtca cccaccctgg 6000 

gtatgacact ggaaatggta ttcagcttcc tggcacttct ggtcagcaac ccagtgttgg 6060 

gcaacaaatg atctttgagg aacatggttt taggcggacc acaccgccca caacggccac 6120 



-206- 



WO 02/092000 



PCT7US02/14877 



ccccataagg cataggccaa gaccataccc gccgaatgta ggacaagaag ctctctctca 6180 

gacaaccatc tcatgggccc cattccagga cacttctgag tacatcattt catgtcatcc 6240 

tgttggcact gatgaagaac ccttacagtt cagggttcct ggaacttcta ccagtgccac 6300 

tctgacaggc ctcaccagag gtgccaccta caacatcata gtggaggcac tgaaagacca 6360 

gcagaggcat aaggttcggg aagaggttgt taccgtgggc aactctgtca acgaaggctt 6420 

gaaccaacct acggatgact cgtgctttga cccctacaca gtttcccatt atgccgttgg 6480 

agatgagtgg gaacgaatgt ctgaatcagg ctttaaactg ttgtgccagt gcttaggctt 6540 

tggaagtggt catttcagat gtgattcatc tagatggtgc catgacaatg gtgtgaacta 6600 

caagattgga gagaagtggg accgtcaggg agaaaatggc cagatgatga gctgcacatg 6660 

tcttgggaac ggaaaaggag aattcaagtg tgaccctcat gaggcaacgt gttacgatga 672 0 

tgggaagaca taccacgtag gagaacagtg gcagaaggaa tatctcggtg ccatttgctc 6780 

ctgcacatgc tttggaggcc agcggggctg gcgctgtgac aactgccgca gacctggggg 6840 

tgaacccagt cccgaaggca ctactggcca gtcctacaac cagtattctc agagatacca 6900 

tcagagaaca aacactaatg ttaattgccc aattgagtgc ttcatgcctt tagatgtaca 6960 

ggctgacaga gaagattccc gagagtaaat catctttcca atccagagga acaagcatgt 702 0 

ctctctgcca agatccatct aaactggagt gatgttagca gacccagctt agagttcttc 7080 

tttctttctt aagccctttg ctctggagga agttctccag cttcagctca actcacagct 7140 

tctccaagca tcaccctggg agtttcctga gggttttctc ataaatgagg gctgcacatt 7200 

gcctgttctg cttcgaagta ttcaataccg ctcagtattt taaatgaagt gattctaaga 7260 

tttggtttgg gatcaatagg aaagcatatg cagccaacca agatgcaaat gttttgaaat 7320 

gatatgacca aaattttaag taggaaagtc acccaaacac ttctgctttc acttaagtgt 7380 

ctggcccgca atactgtagg aacaagcatg atcttgttac tgtgatattt taaatatcca 7440 

cagtactcac tttttccaaa tgatcctagt aattgcctag aaatatcttt ctcttacctg 7500 

ttatttatca atttttccca gtatttttat acggaaaaaa ttgtattgaa aacacttagt 7560 

atgcagttga taagaggaat ttggtataat tatggtgggt gattattttt tatactgtat 7620 

gtgccaaagc tttactactg tggaaagaca actgttttaa taaaagattt acattccaca 7680 



<210> 655 

<211> 1316 

<212> DNA 

<213> Homo sapiens 

<400> 655 

tcctaatacg actcactata gggctcgagc ggccgcccgg gcaggtcgaa tgcaggcgac 60 

ttgcgagctg ggagcgattt aaaacgcttt ggattccccc ggcctgggtg gggagagcga 12 0 

gctgggtgcc ccctagattc cccgcccccg cacctcatga gccgaccctc ggctccatgg 180 

agcccggcaa ttatgccacc ttggatggag ccaaggatat cgaaggcttg ctgggagcgg 24 0 

gaggggggcg gaatctggtc gcccactccc ctctgaccag ccacccagcg gcgcctacgc 30 0 

tgatgcctgc tgtcaactat gcccccttgg atctgccagg ctcggcggag ccgccaaagc 3 60 

aatgccaccc atgccctggg gtgccccagg ggacgtcccc agctcccgtg ccttatggtt 420 

actttggagg cgggtactac tcctgccgag tgtcccggag ctcgctgaaa ccctgtgccc 480 

aggcagccac cctggccgcg taccccgcgg agactcccac ggccggggaa gagtacccca 54 0 

gtcgccccac tgagtttgcc ttctatccgg gatatccggg aacctaccac gctatggcca 600 

gttacctgga cgtgtctgtg gtgcagactc tgggtgctcc tggagaaccg cgacatgact 660 

ccctgttgcc tgtggacagt taccagtctt gggctctcgc tggtggctgg aacagccaga 720 

tgtgttgcca gggagaacag aacccaccag gtcccttttg gaaggcagca tttgcagact 7 80 

ccagcgggca gcaccctcct gacgcctgcg cctttcgtcg cggccgcaag aaacgcattc 840 

cgtacagcaa ggggcagttg cgggagctgg agcgggagta tgcggctaac aagttcatca 900 

ccaaggacaa gaggcgcaag atctcggcag ccaccagcct ctcggagcgc cagattacca 960 

tctggtttca gaaccgccgg gtcaaagaga agaaggttct cgccaaggtg aagaacagcg 1020 

ctacccctta agagatctcc ttgcctgggt gggaggagcg aaagtggggg tgtcctgggg 1080 

agaccagaaa cctgccaagc ccaggctggg gccaaggact ctgctgagag gcccctagag 1140 

acaacaccct tcccaggcca ctggctgctg gactgttcct caggagcggc ctgggtaccc 12 00 
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agtatgtgca gggagacgga accccatgtg acaggcccac tccaccaggg ttcccaaaga 
acctggccca gtcataatca ttcatcctca cagtggcaat aatcacgata accagt 

<210> 656 
<211> 566 
<212> DNA 
<213> Homo sapiens 



1260 
1316 



<400> 656 

cccaccaaac 

ttgaattgga 

gcattgaacg 

agaaagcaaa 

aaaaggaacg 

aggaacgtga 

aaagaaagga 

tagtcagaga 

atgcctggaa 

ttgcgaacac 



ccataaagag 
aacagaaata 
agaaagtgcc 
agaattagct 
taaagttttt 
agaaatacag 
gaccaaatgg 
gaacacagac 
gagagcagaa 
atctgttcga 



ggtgggtcga 
gaaaagttta 
ttggaaaaac 
cgaatagaag 
gaaaagtata 
actttaaaac 
tcaagtacac 
ctccgggaag 
gccatagaga 
tttcaa 



cccacgcgtc 
aagctgagaa 
tcaggaaaga 
agtttaaaaa 
ctacagctgc 
agcaaatagc 
acagccgtct 
aaataaaagt 
gcagcctcga 



cgcggacgcg 
cgcatcttta 
aattgcagac 
ggaggagatg 
aagaactttt 
agatttacgg 
cagaagccag 
gatggaaaga 
ggtggagaag 



tgggaaatta 
gctaaacttc 
ttcgaacaac 
aggaagctac 
ccagataaaa 
gaagatttga 
atacaaatgt 
ttccgactgg 
aaggacaagc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
566 



<210> 657 

<211> 5067 

<212> DNA 

<213> Homo sapiens 



<400> 657 

gcccggacac 

ttcatcgagg 

ggtgcaggca 

aaaatccacc 

agtagaggcc 

ggcagcaagc 

cctggggcag 

tccaggccat 

ctggcgactt 

ctcactcatg 

ccaggagtgt 

agtgacccac 

ctgagctcca 

cgcagcagcg 

ggcctgcctt 

actccttctg 

tcaaactcgg 

caaccctggt 

ccagctggcc 

tgcacagacc 

tccaccctgc 

ctgggatctg 

ctcgtcccag 

caacgtctgg 

aaatgcagca 

catgacacat 

tttgtcaacg 

gctgacaggt 



ctgtctgcag 
acctgaacat 
acaaccccga 
tccagcagca 
ctgtcaatgg 
tgactgtgga 
tcaccaccct 
acctgcatgg 
ctgagatgtc 
gagactatta 
cccccagcat 
cactgcccaa 
gcaggtcttc 
agaagccaac 
ccccaaactt 
tgtcagcacc 
ggctgggggg 
tccaggatgg 
tggacggttc 
ttggcactgg 
ctgagttatc 
tgctcctgga 
gtcctgagct 
agcgagagat 
aaggggtgtt 
gcttcacctg 
gcaaagtgtt 
gttttctttg 



catggataag 
gtatgaggcc 
gtttgaggaa 
gcagcagcag 
agggggccgc 
tggtgctgcc 
cgctgctggg 
cacgaggcat 
tgctttccac 
tgacaacctc 
cggcctgagt 
accctgcggg 
tgagggtagc 
aggcctttgg 
ggagaacgga 
cttggccctg 
tgaggtttca 
gcccaaatct 
acagcagggt 
tcccaagctc 
ttgtaaagag 
cagccccagc 
gagaccctct 
ggatgctcac 
tggggctggc 
tgcagcttgc 
ttgtgaagaa 
tggacatctg 



tatgacgacc 
tctaaggatg 
actcgcaggg 
ctcctgcagg 
ctgggcccac 
aagcctcctc 
cagcccccgt 
ggcagccagg 
cagccaggcc 
tccttggcaa 
gtagggagtg 
gaccatcccc 
ctcggtggtc 
tccactgcct 
gcaccagctg 
agctgcccca 
ggtgtgatgt 
tacctttcca 
gcggtccctg 
agccccacca 
ggtcccctgg 
tcccctaggg 
gctgctgagt 
ccgaaggctg 
caggcctgtc 
agccggaagc 
gacttcctgt 
atcatggaca 



tgggcctgga 
ggctcttccg 
tgttcgccac 
aggagactct 
aggcccgttg 
ttgctgcctc 
acccaccgca 
actgtggttc 
cctgtgagga 
gcccaaagtg 
ggtggcctag 
taaatcaccg 
agaatagtgg 
cctcccagcg 
tggggcctgt 
ggcaaggagg 
ccaaacccaa 
gttctgcccc 
ggctggggcc 
gtcttgtcca 
gctggtcttc 
taaggctgcc 
tgaaattaga 
attactttgg 
aggccatggg 
tgagaggaaa 
actctggttt 
tgatcctgca 



ggccagtaaa 
agtggacaag 
caagatggcc 
gcccaggggg 
ggaagttgtg 
gacaggggca 
ggagcagaga 
cagggagagc 
tccttcctgc 
gggtgacaaa 
ctccccgggg 
acagctctcc 
cattggtggc 
ggtgagccct 
tcagcccagg 
tcttccaaga 
tgtggacccc 
gtcatcctcg 
gaagcctggc 
tccagtgatg 
tgatggtagc 
ctgccagccc 
agccctcacc 
agcctgtgtg 
gaacctctac 
agccttttat 
ccagcagtcg 
agccctgggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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aagtcctacc accccggctg tttccgctgt gtcatctgta atgagtgttt ggatggggtg 1740 

cccttcaccg tggactcaga gaacaagatc tactgtgtcc gagattacca caaggtgctg 1800 

gcccccaagt gtgcagcctg tgggcttccc atccttccac ctgagggctc agatgagacc 1860 

atccgtgtcg tgtccatgga cagagactac cacgtggagt gttaccactg cgaggactgt 192 0 

ggtctggagc tcaatgatga agatggccac cgctgttatc cgctggagga ccacctgttc 1980 

tgtcactcct gccacgtgaa gaggctggag aagagaccct catctacagc ccttcaccag 2 040 

caccacttct agccagagcc acttgcagac atcacggcag gggatgagga gccggggttg 2100 

ctgctgctgc ttccggtggc ccctggggtg gaagtggggt aggggaagag gaggggcagg 2160 

agggagagtt cctgtgagca tgtggggggt gcctttcctt taaccaggga ggtgaacact 2220 

acctgcctcc tgcgtgtatt ttccaagtgc ttttctctgt tgccacattt tcctcaggtt 22 80 

actcaggaaa atgctccagc atgtgcgagc acatgacctg aggttgcatc atagcaccaa 2340 

aggaatcctc ctgtcccctc tgggaacatt tcatgcttca gagggagagg tttttattga 2400 

gcttgtttca caatatcccc ttgaagggac agctcagctg ccaatacatt caaccctttc 2460 

tcttccttca ggaaaatacc tatacccaaa tgttccctcc cccgacatat atcatggcat 2520 

gacttaaggc ttcttttcac ctgagagctt cagttcttct gcagaatggc tgcaaattta 25 8 0 

attgcattaa ggcaagaagg aagctctaat gtgtgctttg tatcctaaga taaatttgct 2640 

tagaaaacca gagtcaagat ttgaaatagg tgaggcaggg tttcctcctt agacactgac 2700 

agcattctcc gtaccccttc aaatccttac tctcctaaag gcagctgagt ccgcgacaga 2760 

aatttgccct atgggagtaa aacatacttt gggagaagaa cttggtgcag gcaccaggat 2 82 0 

tttttttttt gcccacgtgt ttgcgctgtt tttctctgga gttctcaaga gttggtgact 2 880 

tggaaggccg cttctgcaag gcaagtctca ggaacccatg caggtacatc gcttgcacct 2 940 

gtttttagct tatttaatga cgggcttttg ggaagagctg cccgcatact gagagacagc 3 000 

ttcttataaa caaggagagt ttttgtgtgt gcgagatctc taagccagcg tgggagggag 3 060 

cgcctcagga taagttatta tattcatttc gttggtttct ctcctgccca attcttggca 3120 

caggcattat gtttgaagaa accaggataa ggtacactgc ttttgtctgt ttaatttttt 3180 

tagttgtttc ccttcacttt cagtcttcca cacacaaaaa atacctcaca gagcttcacc 3240 

aaatcacaga ttcaggagga atttggcttt cacactggac tcagatacct tcttcagtgt 3 300 

gttggaaatc actggcttca cacaggccca actccaactg gtcagggcag agtgatcgta 33 60 

actaaaggtc agtggggaat agatccgatt cagtgctttt gccttatgca tttcagcatc 342 0 

ctggctcccc agggtggcag gagctgagga agggccacac actggcaaga tttcaagacc 34 80 

actctctgca ctgaagaggt aaaatttgca ctgcaagtca catccctgag gccagaggtc 354 0 

agtacccttt ggtatttcga ttagaagaag ctgcaaaaga aaggcagccc attttaccat 3600 

tgccagccag gccggggaca caggagccgg tgtgtgcact ctgcctccta acattgcacc 3 660 

cagagcaaga ggactgggtg ctgggctgca gaggccggtc agtggagccc ctagcacgtg 372 0 

tgaactcagg cttttcattg ggcccggctc cacttctagg ccatgttttg actcatttgg 3780 

taaccattgc ctgtaagcag cacagaattg gtgccatgga ttatcttttc catgttgatg 3840 

gaattcattc tgttggaatc ctttggccag atgtcacttc agccagggtg tgcatcatca 3900 

ttggttcttt ttcacaggct gagcctcctg aaaacccatg aacgctgggg ctggggaagt 3960 

gaaccctgag gtggggaccc tctcttccca tcaaatcatc cagctcagtg tggggcgtgg 4020 

caggggggta aatgaagcca gccaatgtgt taacctgtct ctgtcaacct aagaatgttg 40 80 

gccttactga cacacctttg ctccatgttc aagaccagaa gtagctggga tttgtttgca 4140 

aattgggtaa ttagtttaaa aatctgtgat tacattttta aatgaaattt tcaaagtggc 4200 

ctagattgag gtgattcaga taggtttgcg aatataccat tttatattgt tgagaaagaa 42 60 

caaaaaggga atttccagat gtcctagaaa tcctagcaac agatttctct ggttgtcagt 432 0 

ttccctggag aaggcgccag ataggaatct ccaatcagtt gtttttctct tcgcttcagg 4380 

cccttacaca aaagccatga agagatgttc acctacccgg tattttaaat gttctgtaaa 4440 

ttattagcca aatagaactg taatggggtt gtatttatgg gcgcctagaa agaaaacaca 4500 

aggacttggt aggccaggaa gaaaagattt taaaatttag aatgaatagc ccttctgggt 4560 

tttctttttg acaattcttg gacttgaggt aaaacaagga ggattgtggc cggatttcag 462 0 

atcccaaagc cagcctccat cttaggcctt tgcctcattg tgccttttag gttttcttac 4680 

ccaccgtctc ctgttttgtc ttttttttct tttctcctac ccctatcttg ggacattcag 474 0 

aaactgcctg ggtggtttga gaagagacaa cccagtttga tctgcaatac aaggatccat 4800 

tcgtaatctc tctctcactg atgttattcc cccatctgcc gtcttggttc atctcaccac 4 860 

agaagggcat ttagtcctac ccagccatcg gctgcggtat gacagcagga tggcacttcc 4 92 0 



-209- 



WO 02/092000 



PCT/US02/14877 



catttctctg tggttagtgc tcgagtgaaa acctctttca gctgagtcct ctgaggttct 4980 
gctgttgagt cctgggtggc tgatggaatg attgaggagg tctggtcacc ctcaagcgcc 504 0 
gtcatcgcct tgtttccatg ggcttct 5067 



<210> 658 
<211> 950 
<212> DNA 
<213> Homo sapiens 



<400> 658 

tcgaccggat ccgaattccc attgtgcact aaagcgtctc cctgctccgc ggcccgggct 60 

ggcgggcggg cgctcggctg gcggctgcag cagcagaggg agacccgcgg caaccccggc 12 0 

aacccagggc tcggcgtcgc tgccaccatg acgggaagca atatgtcgga cgccttggcc 18 0 

aacgccgtgt gccagcgctg ccaggcccgc ttctcccccg ccgagcgcat tgtcaacagc 240 

aatggggagc tgtaccatga gcactgcttc gtgtgtgccc agtgcttccg gcccttcccc 300 

gaggggctct tctatgagtt tgaaggccgg aagtactgcg aacacgactt ccaaatgctg 360 

tttgctccgt gctgtggatc ctgcggtgag ttcatcattg gccgcgtcat caaggccatg 420 

aacaacaact ggcacccggg ctgcttccgc tgcgagctgt gtgatgtgga gctggctgac 480 

ctgggctttg tgaagaatgc cggcaggcat ctctgccggc cttgccacaa ccgtgagaag 540 

gccaagggcc tgggcaagta catctgccag cggtgccacc tggtcatcga cgagcagccc 600 
ctcatgttca ggagcgacgc ctaccaccct gaccacttca actgcaccca ctgtgggaag . 660 

gagctgacag ccgaggcccg cgagctgaag ggtgagctct actgcctgcc ctgccatgac 720 

aagatgggcg ttcccatctg cggggcctgc cgccggccca tcgagggccg agtggtcaac 780 

gcgctgggca agcagtggca cgtggagcac tttgtctgtg ccaagtgtga gaagccattc 840 

ctggggcacc ggcactatga gaagaagggc ctggcctact gcgagcttta gtgcacaatg 900 

ggcggccgcc tcgagtctag actcgagtag ataattgagc ggaatttctt 950 

<210> 659 

<211> 2346 

<212> DNA 

<213> Homo sapiens 



<400> 659 

ccgccgtcgc ccccgcctcc ccctgcctca gcggctgccc ccgccagcgg gccgcccgct 60 

cccccgggcc ttgcagcggg ccccggcccg gctggagggg ccccgacccc agctctggtg 120 

gcgggcagca gcgccgcggc ccccttccct cacggggact cggccctgaa cgagcaggag 180 

aaggagttgc agcggcggct gaagcgtctc tacccggccg tggacgaaca agagacgccg 240 

ctgcctcggt cctggagccc gaaggacaag ttcagctaca tcggcctctc tcagaacaac 3 00 

ctgcgggtgc actacaaagg tcatggcaaa accccaaaag atgccgcgtc agttcgagcc 360 

acgcatccaa taccagcagc ctgtgggatt tattattttg aagtaaaaat tgtcagtaag 420 

ggaagagatg gttacatggg aattggtctt tctgctcaag gtgtgaacat gaatagacta 480 

ccaggttggg ataagcattc atatggttac catggggatg atggacattc gttttgttct 540 

tctggaactg gacaacctta tggaccaact ttcactactg gtgatgtcat tggctgttgt 600 

gttaatctta tcaacaatac ctgcttttac accaagaatg gacatagttt aggtattgct 660 

ttcactgacc taccgccaaa tttgtatcct actgtggggc ttcaaacacc aggagaagtg 720 

gtcgatgcca attttgggca acatcctttc gtgtttgata tagaagacta tatgcgggag 780 

tggagaacca aaatccaggc acagatagat cgatttccta tcggagatcg agaaggagaa 84 0 

tggcagacca tgatacaaaa aatggtttca tcttatttag tccaccatgg gtactgtgcc 900 

acagcagagg cctttgccag atctacagac cagaccgttc tagaagaatt agcttccatt 960 

aagaatagac aaagaattca gaaattggta ttagcaggaa gaatgggaga agccattgaa 102 0 

acaacacaac agttataccc aagtttactt gaaagaaatc ctaatctcct tttcacatta 1080 

aaagtgcgtc agtttataga aatggtgaat ggtacagata gtgaagtacg atgtttggga 1140 

ggccgaagtc caaagtctca agacagttat cctgttagtc ctcgaccttt tagtagtcca 12 00 
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agtatgagcc ccagccatgg aatgaatatc cacaatttag catcaggcaa aggaagcacc 1260 

gcacattttt caggttttga aagttgtagt aatggtgtaa tatcaaataa agcacatcaa 1320 

tcatattgcc atagtaataa acaccagtca tccaacttga atgtaccaga actaaacagt 1380 

ataaatatgt caagatcaca gcaagttaat aactt caeca gtaatgatgt agacatggaa 1440 

acagatcact actccaatgg agttggagaa acttcatcca atggtttcct aaatggtagc 1500 

tctaaacatg accacgaaat ggaagattgt gacaccgaaa tggaagttga ttcaagtcag 1560 

ttgagacgee agttgtgtgg aggaagtcag gccgccatag aaagaatgat ccactttgga 1620 

egagagctge aagcaatgag tgaacagcta aggagagact gtggcaagaa cactgcaaac 1680 

aaaaaaatgt tgaaggatgc attcagtcta ctagcatatt cagatccctg gaacagccca 174 0 

gttggaaatc agcttgaccc gattcagaga gaacctgtgt gctcagctct taacagtgca 1800 

atattagaaa cccacaatct gecaaagcaa cctccacttg ccctagcaat gggacaggee 1860 

acacaatgtc taggactgat ggctcgatca ggaattggat cctgcgcatt tgccacagtg 192 0 

gaagactacc tacattagct atgeatttea agagctcaca cttatattgt ggcatatagt 1980 

caacatggaa gtagaccagc tetgetgatt tgaaatttag attttttaaa ttatgtactg 2 040 

gggacaggtt tttgtcgctt tacattgett cctagtttac ageatgatge aaatgatttt 2100 

ctaacttagt gttaggagaa attattttcc atctttaacc tcttagttgt ctaagagtta 2160 

aatattactg aatttcagac gttcaaattg atcatcacaa atcctttaaa acaattacct 222 0 

aaaagaaacc aaaaatcctg ccttctttgt gggggagggg ggagagaggg gaaggaaatg 2280 

gaacaagttg tgtttgtgtt agcatgtggg tgatgtaaac ttcaaattgg gagatgttcc 2340 

gacccc 2346 

<210> 660 

<211> 2512 

<212> DNA 

<213> Homo sapiens 



<400> 660 

caatgeactg aeggatatga gtgggatcct gtgagacagc aatgeaaaga tattgatgaa 60 

tgtgacattg tcccagacgc ttgtaaaggt ggaatgaagt gtgtcaacca ctatggagga 12 0 

tacctctgcc ttccgaaaac ageccagatt attgtcaata atgaacagee tcagcaggaa 180 

acacaaccag cagaaggaac ctcaggggca accaccgggg ttgtagctgc cagcagcatg 240 

gcaaccagtg gagtgttgcc cgggggtggt tttgtggcca gtgctgctgc agtegcagge 3 00 

ectgaaatge agactggccg aaataacttt gtcatccggc ggaacccagc tgaccctcag 360 

cgcattccct ccaacccttc ccaccgtatc cagtgtgcag caggctacga gcaaagtgaa 42 0 

cacaacgtgt gecaagacat agacgagtgc actgeaggga cgcacaactg tagagcagac 4 80 

caagtgtgca tcaatttacg gggatccttt gcatgtcagt gccctcctgg atatcagaag 540 

cgaggggagc agtgcgtaga catagatgaa tgtaccatcc ctccatattg ccaccaaaga 600 

tgcgtgaata caccaggctc attttattgc cagtgcagtc ctgggtttca attggcagca 660 

aacaactata ectgegtaga tataaatgaa tgtgatgcca gcaatcaatg tgctcagcag 72 0 

tgctacaaca ttcttggttc attcatctgt cagtgeaate aaggatatga gctaagcagt 780 

gacaggctca actgtgaaga cattgatgaa tgeagaaect caagctacct gtgtcaatat 84 0 

caatgtgtca atgaacctgg gaaattctca tgtatgtgcc cccagggata ccaagtggtg 900 

agaagtagaa catgtcaaga tataaatgag tgtgagacca caaatgaatg cegggaggat 960 

gaaatgtgtt ggaattatca tggeggctte cgttgttatc cacgaaatcc ttgtcaagat 1020 

ccctacattc taacaccaga gaaccgatgt gtttgcccag tetcaaatge catgtgccga 1080 

gaactgcccc agtcaatagt ctacaaatac atgagcatcc gatctgatag gtctgtgcca 114 0 

tcagacatct tccagataca ggccacaact atttatgeca acaccatcaa tacttttegg 1200 

attaaatctg gaaatgaaaa tggagagttc tacctacgac aaacaagtcc tgtaagtgca 1260 

atgcttgtgc tcgtgaagtc attatcagga ccaagagaac atatcgtgga cctggagatg 132 0 

ctgacagtca gcagtatagg gaccttccgc acaagctctg tgttaagatt gacaataata 138 0 

gtggggccat tttcatttta gtcttttcta agagtcaacc acaggcattt aagtcageca 144 0 

aagaatattg ttaccttaaa gcactatttt atttatagat atatctagtg catctacatc 1500 

tctatactgt acactcaccc ataacaaaca attacaccat ggtataaagt gggcatttaa 1560 

tatgtaaaga ttcaaagttt gtctttatta ctatatgtaa attagacatt aatccactaa 162 0 
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actggtcttc ttcaagagag ctaagtatac actatctggt gaaacttgga ttctttccta 168 0 

taaaagtggg accaagcaat gatgatcttc tgtggtgctt aaggaaactt actagagctc 1740 

cactaacagt ctcataagga ggcagccatc ataaccattg aatagcatgc aagggtaaga 1800 

atgagttttt aactgctttg taagaaaatg gaaaaggtca ataaagatat atttctttag 1860 

aaaatgggga tctgccatat ttgtgttggt ttttattttc atatccagcc taaaggtggt 1920 

tgtttattat atagtaataa atcattgctg tacaacatgc tggtttctgt agggtatttt 1980 

taattttgtc agaaatttta gattgtgaat attttgtaaa aaacagtaag caaaattttc 2 040 

cagaattccc aaaatgaacc agataccccc tagaaaatta tactattgag aaatctatgg 2100 

ggaggatatg agaaaataaa ttccttctaa accacattgg aactgacctg aagaagcaaa 2160 

ctcggaaaat ataataacat ccctgaattc aggcattcac aagatgcaga acaaaatgga 222 0 

taaaaggtat ttcactggag aagttttaat ttctaagtaa aatttaaatc ctaacacttc 228 0 

actaatttat aactaaaatt tctcatcttc gtacttgatg ctcacagagg aagaaaatga 2340 

tgatggtttt tattcctggc atccagagtg acagtgaact taagcaaatt accctcctac 2400 

ccaattctat ggaatatttt atacgtctcc ttgtttaaaa tctgactgct ttactttgat 2460 

gtatcatatt tttaaataaa aataaatatt cctttagaag atcactctaa aa 2512 

<210> 661 

<211> 2306 

<212> DNA 

<213> Homo sapiens 



<400> 661 

gggcgggagc tgcacgcgcc gtggctccgg atctcttcgt ctttgcagcg tacgcccgag 60 

tcggtcagcg ccggaggacc tcagcagcca tgtcgaagcc ccatagtgaa gccgggactg 120 

ccttcattca gacccagcag ctgcacgcag ccatggctga cacattcctg gagcacatgt 180 

gccgcctgga cattgattca ccacccatca cagcccggaa cactggcatc atctgtacca 240 

ttggcccagc ttcccgatca gtggagacgt tgaaggagat gattaagtct ggaatgaatg 300 

tggctcgtct gaacttctct catggaactc atgagtacca tgcggagacc atcaagaatg 3 60 

tgcgcacagc cacggaaagc tttgcttctg acccctacct ctaccggccc gttgctgtgg 420 

ctctagacac taaaggacct gagatccgaa ctgggctcat caagggcagc ggcactgcag 480 

agctggagct gaagaaggga gccactctca aaatcacgct ggataacgcc tacatggaaa 540 

agtgtgacga gaacatcctg tggctggact acaagaacat ctgcaaggtg gtggaagtgg 600 

gcagcaagat ctacgtggat gatgggctta tttctctcca ggtgaagcag aaaggtgccg 660 

acttcctggt gacggaggtg gaaaatggtg gctccttggg cagcaagaag ggtgtgaacc 720 

ttcctggggc tgctgtggac ttgcctgctg tgtcggagaa ggacatccag gatctgaagt 780 

ttggggtcga gcaggatgtt gatatggtgt ttgcgtcatt catccgcaag gcatctgatg 840 

tccatgaagt taggaaggtc ctgggagaga agggaaagaa catcaagatt atcagcaaaa 900 

tcgagaatca tgagggggtt cggaggtttg atgaaatcct ggaggccagt gatgggatca 960 

tggtggctcg tggtgatcta ggcattgaga ttcctgcaga gaaggtcttc cttgctcaga 1020 

agatgatgat tggacggtgc aaccgagctg ggaagcctgt catctgtgct actcagatgc 1080 

tggagagcat gatcaagaag ccccgcccca ctcgggctga aggcagtgat gtggccaatg 1140 

cagtcctgga tggagccgac tgcatcatgc tgtctggaga aacagccaaa ggggactatc 1200 

ctctggaggc tgtgcgcatg cagcacctga ttgcccgtga ggcagaggct gccatctacc 1260 

acttgcaatt atttgaggaa ctccgccgcc tggcgcccat taccagcgac cccacagaag 132 0 

ccaccgccgt gggtgccgtg gaggcctcct tcaagtgctg cagtggggcc ataatcgtcc 13 80 

tcaccaagtc tggcaggtct gctcaccagg tggccagata ccgcccacgt gcccccatca 1440 

ttgctgtgac ccggaatccc cagacagctc gtcaggccca cctgtaccgt ggcatcttcc 1500 

ctgtgctgtg caaggaccca gtccaggagg cctgggctga ggacgtggac ctccgggtga 1560 

actttgccat gaatgttggc aaggcccgag gcttcttcaa gaagggagat gtggtcattg 162 0 

tgctgaccgg atggcgccct ggctccggct tcaccaacac catgcgtgtt gttcctgtgc 1680 

cgtgatggac cccagagccc ctcctccagc ccctgtccca cccccttccc ccagcccatc 1740 

cattaggcca gcaacgcttg tagaactcac tctgggctgt aacgtggcac tggtaggttg 1800 

ggacaccagg gaagaagatc aacgcctcac tgaaacatgg ctgtgtttgc agcctgctct 1860 

agtgggacag cccagagcct ggctgcccat catgtggccc cacccaatca agggaagaag 192 0 
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gaggaatgct ggactggagg cccctggagc cagatggcaa gagggtgaca gcttcctttc 

ctgtgtgtac tctgtccagt tcctttagaa aaaatggatg cccagaggac tcccaaccct 

ggcttggggt caagaaacag ccagcaagag ttaggggcct tagggcactg ggctgttgtt 

ccattgaagc cgactctggc cctggccctt acttgcttct ctagctctct aggcctctcc 

agtttgcacc tgtccccacc ctccactcag ctgtcctgca gcaaacactc caccctccac 

cttccatttt cccccactac tgcagcacct ccaggcctgt tgctatagag cctacctgta 

tgtcaataaa caacagctga agcacc 



1980 
2040 
2100 
2160 
2220 
2280 
2306 



<210> 662 

<211> 2656 

<212> DNA 

<213> Homo sapiens 

<400> 662 

gaattcgcgg ccgcagagtc cccgggccaa gatggctgcg cggtgctcca cacgctggtt 60 

gctggtggtt gtggggaccc cgcggctgcc ggctatatcg ggtagagggg cccggccgcc 120 

cagggagggc gtggtggggg catggctgag ccgcaagctg agcgtccccg cctttgcgtc 180 

ttccctgacc tcttgcggcc cccgagcgct gctgacattg agacctggtg tcagccttac 240 

aggaacaaaa cataaccctt tcatttgtac tgcctccttc cacacgagtg cccctttggc 300 

caaagaagat tattatcaga tattaggagt gcctcgaaat gccagccaga aagagatcaa 360 

gaaagcctat tatcagcttg ccaagaagta tcaccctgac acaaataagg atgatcccaa 420 

agccaaggag aagttctccc agctggcaga agcctatgag gttttgagtg atgaggtgaa 480 

gaggaagcag tacgatgcct acggctctgc aggcttcgat cctggggcca gcggctccca 54 0 

gcatagctac tggaagggag gccccactgt ggaccccgag gagctgttca ggaagatctt 600 

tggcgagttc tcatcctctt catttggaga tttccagacc gtgtttgatc agcctcagga 660 

atacttcatg gagttgacat tcaatcaagc tgcaaagggg gtcaacaagg agttcaccgt 720 

gaacatcatg gacacgtgtg agcgctgcaa cggcaagggg aacgagcccg gcaccaaggt 780 

gcagcattgc cactactgtg gcggctccgg catggaaacc atcaacacag gcccttttgt 840 

gatgcgttcc acgtgtagga gatgtggtgg ccgcggctcc atcatcatat cgccctgtgt 900 

ggtctgcagg ggagcaggac aagccaagca gaaaaagcga gtgatgatcc ctgtgcctgc 960 

aggagtcgag gatggccaga ccgtgaggat gcctgtggga aaaagggaaa ttttcattac 102 0 

gttcagggtg cagaaaagcc ctgtgttccg gagggacggc gcagacatcc actccgacct 108 0 

ctttatttct atagctcagg ctcttcttgg gggaacagcc agagcccagg gcctgtacga 1140 

gacgatcaac gtgacgatcc cccctgggac tcagacagac cagaagattc ggatgggtgg 1200 

gaaaggcatc ccccggatta acagctacgg ctacggagac cactacatcc acatcaagat 1260 

acgagttcca aagaggctaa cgagccggca gcagagcctg atcctgagct acgccgagga 1320 

cgagacagat gtggagggga cggtgaacgg cgtcaccctc accagctctg gtggcagcac 13 80 

catggatagc tccgcaggaa gcaaggctag gcgtgaggct ggggaggacg aggagggatt 1440 

cctttccaaa cttaagaaaa tgtttacctc atgatatccc agccgaggaa aaagatccac 1500 

tggaaactag gccgggaagc agcagcccct ccaagggcca gggcacctgg gagacgggag 1560 

gattccagaa cagcagcact gagctcccac ccgcagagcc tctggacggc cttggcaaca 162 0 

gcaaaatcat gggacaacac ctctctccac ggaaaggtca cagtggacag cccgggcagt 1680 

aggatgcagc cccagaggct ggtggcagtt tcctgtccat tggtaggtga cggccccctg 1740 

gtcagcagag gagaggttag atcttgcagg ctaaaactct aatttggaat tgaatattgt 1800 

ggatatctta gttaaaggcc atgcttacag cttagaaatg aagccttaag ctgcatcaag 1860 

ttacgaagtg attaatttcc ttctcagcaa acctccggga ggttccagaa tgagttcttc 192 0 

ctgacaggtt gtcttcactg ggagcgtggg gcccccaggc cccaccagca ccgtcctccc 1980 

ctaatgaggg gccctgccga ggcatcagct gctctgctca gttagttttt attcccgggg 204 0 

taccaagcag ctgcacagtc ggtgcctggg aagcacgtta aaggcccaga gagatcctgg 2100 

gggttctgct ctgaccgtgt gggtggtgat ccttgtcagg atgtacagtc ctfcgctccca 2160 

ccccatccgg gatggccgcc tgtccctgac tattgagtcc tgttgttgta agccaggcat 222 0 

ggagggctcc tgcccttctg ctgagccaca gcccattgca gcactgtgct ggccagactt 2280 

cagctgcctt gggaactgaa gccctgccac tgttgctagt caggggcttg gttctcccac 2340 

ttacactgtt gacatctatt ttctgaagtg tgtttaaatt attcagtgct aatcattgtt 2400 
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ttttcctttg taaatgttga ttcagaaaag gaaagcacag gctaagcagt tgaaggttcc 2460 

ccaccattca gtgagagcag aacccccatt ccccagcctc tgctggtagc atgtcgcagt 252 0 

ttccatgtgt ttcaggatct tcgggctgtc gttagacagg ttaatgaaga acacttctca 2580 

acagtttcct ttttgttttc ctttataatt cactaaaata aagcatctat tagtgtctga 2640 

aaaaaaaaaa aaaaaa 2656 



<210> 663 

<211> 2217 

<212> DNA 

<213> Homo sapiens 



<400> 663 

gcggacccgg cgccgaggcg gccacccgag acgcggcgcg cacgctccgg cctgcgcagc 60 

ccggcccggc catggcggcc ccccgcccgt ctcccgcgat ctccgtttcg gtctcggctc 12 0 

cggcttttta cgccccgcag aagaagttcg gccctgtggt ggccccaaag cccaaagtga 180 

atcccttccg gcccggggac agcgagcctc ccccggcacc cggggcccag cgcgcacaga 240 

tgggccgggt gggcgagatt cccccgccgc ccccggaaga ctttcccctg cctccacctc 30 0 

cccttgctgg ggatggcgac gatgcagagg gtgctctggg aggtgccttc ccgccgcccc 360 

ctcccccgat cgaggaatca tttccccctg cgcctctgga ggaggagatc ttcccttccc 42 0 

cgccgcctcc tccggaggag gagggagggc ctgaggcccc cataccgccc ccaccacagc 480 

ccagggagaa ggtgagcagt attgatttgg agatcgactc tctgtcctca ctgctggatg 540 

acatgaccaa gaatgatcct ttcaaagccc gggtgtcatc tggatatgtg cccccaccag 600 

tggccactcc attcagttcc aagtccagta ccaagcctgc agccgggggc acagcacccc 660 

tgcctccttg gaagtcccct tccagctccc agcctctgcc ccaggttccg gctccggctc 72 0 

agagccagac acagttccat gttcagcccc agccccagcc caagcctcag gtccaactcc 780 

atgtccagtc ccagacccag cctgtgtctt tggctaacac ccagccccga gggcccccag 840 

cctcatctcc ggctccagcc cctaagtttt ctccagtgac tcctaagttt actcctgtgg 900 

cttccaagtt cagtcctgga gccccaggtg gatctgggtc acaaccaaat caaaaattgg 960 

ggcaccccga agctctttct gctggcacag gctcccctca acctcccagc ttcacctatg 1020 

cccagcagag ggagaagccc cgagtgcagg agaagcagca ccccgtgccc ccaccggctc 1080 

agaaccaaaa ccaggtgcgc tcccctgggg ccccagggcc cctgactctg aaggaggtgg 1140 

aggagctgga gcagctgacc cagcagctaa tgcaggacat ggagcatcct cagaggcaga 12 0 0 

atgtggctgt caacgaactc tgcggccgat gccatcaacc cctggcccgg gcgcagccag 1260 

ccgtccgcgc tctagggcag ctgttccaca tcgcctgctt cacctgccac cagtgtgcgc" 132 0 

agcagctcca gggccagcag ttctacagtc tggagggggc gccgtactgc gagggctgtt 13 80 

acactgacac cctggagaag tgtaacacct gcggggagcc catcactgac cgcatgctga 1440 

gggccacggg caaggcctat cacccgcact gcttcacctg tgtggtctgc gcccgccccc 150 0 

tggagggcac ctccttcatc gtggaccagg ccaaccggcc ccactgtgtc cccgactacc 1560 

acaagcagta cgccccgagg tgctccgtct gctctgagcc catcatgcct gagcctggcc 162 0 

gagatgagac tgtgcgagtg gtcgccctgg acaagaactt ccacatgaag tgttacaagt 1680 

gtgaggactg cgggaagccc ctgtcgattg aggcagatga caatggctgc ttccccctgg 174 0 

acggtcacgt gctctgtcgg aagtgccaca ctgctagagc ccagacctga gtgaggacag 1800 

gccctcttca gaccgcagtc catgccccat tgtggaccac ccacactgag accacctgcc 1860 

cccacctcag ttattgtttt gatgtctagc ccctcccatt tccaacccct ccctagcatc 192 0 

ccaggtgccc tgacccagga cccaacatgg tctagggatg caggatcccc gccctggggt 198 0 

ctggtcctcg cccatcctgc agggattgcc caccgtcttc cagacacccc acctgagggg 2 04 0 

ggcaccaggt ttagtgctgc tgctttcact gctgcacccg cgccctcggc cggccccccg 2100 

agcagccttt gtactctgct tgcggagggc tgggagaccc tccaggacat tcccaccctc 2160 

ccccatgctg ccaagttgta gctatagcta caaataaaaa aaaaccttgt tttccag 2217 



<210> 664 

<211> 8906 

<212> DNA 

<213> Homo sapiens 
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<400> 664 

gaggcggcca aggacctggc cgacatcgcg gccttcttcc gatccgggtt tcgaaaaaac 60 

gatgaaatga aagctatgga tgttttacca attttgaagg aaaaagttgc atacctttca 120 

ggtgggagag ataaacgtgg aggtcccatt ttaacgtttc cggcccgcag caatcatgac 180 

agaatacgac aggaggatct caggagactc atttcctatc tagcctgtat tcccagcgag 240 

gaggtctgca agcgtggctt cacggtgatc gtggacatgc gtgggtccaa gtgggactcc 300 

atcaagcccc ttctgaagat cctgcaggag tccttcccct gctgcatcca tgtggccctg 360 

atcatcaagc cagacaactt ctggcagaaa cagaggacta attttggcag ttctaaattt 420 

gaatttgaga caaatatggt ctctttagaa ggccttacca aagtagttga tccttctcag 480 

ctaactcctg agtttgatgg ctgcctggaa tacaaccacg aagaatggat tgaaatcaga 540 

gttgcttttg aagactacat tagcaatgcc acccacatgc tgtctcggct ggaggaactt 600 

caggacatcc tagctaagaa ggagctgcct caggatttag agggggctcg gaatatgatc 660 

gaggaacatt ctcagctgaa gaagaaggtg attaaggccc ccatcgagga cctggatttg 72 0 

gagggacaga agctgcttca gaggatacag agcagtgaaa gctttcccaa aaagaactca 780 

ggctcaggca atgcggacct gcagaacctc ttgcccaagg tgtccaccat gctggaccgg 840 

ctgcactcga cacggcagca tctgcaccag atgtggcatg tgaggaagct gaagctggac 900 

cagtgcttcc agctgaggct gtttgaacag gatgctgaga agatgtttga. ctggatcaca 960 

cacaacaaag gcctgtttct aaacagctac acagagattg ggaccagcca ccctcatgcc 102 0 

atggagcttc agacgcagca caatcacttt gccatgaact gtatgaacgt gtatgtaaat 108 0 

ataaaccgca tcatgtcggt ggccaatcgt ctggtggagt ctggccacta tgcctcgcag 1140 

cagatcaggc agatcgcgag tcagctggag caggagtgga aggcgtttgc ggcagccctg 1200 

gatgagcgga gcaccttgct ggacatgtcc tccattttcc accagaaggc cgaaaagtat 1260 

atgagcaacg tggattcatg gtgtaaagct tgcggtgagg tagaccttcc ctcagagctg 132 0 

caggacctag aagatgccat tcatcaccac cagggaatat atgaacatat cactcttgct 1380 

tattctgagg tcagccaaga tgggaagtcg ctccttgaca agctccagcg gcccttgact 144 0 

cccggcagct ccgattccct gacagcctct gccaactact ccaaggccgt gcaccatgtc 1500 

ctggatgtca tccacgaggt gctgcaccac cagcggcacg tgagaacaat ctggcaacac 1560 

cgcaaggtcc ggctgcatca gaggctgcag ctgtgtgttt tccagcagga agttcagcag 162 0 

gtgctagact ggatcgagaa ccacggagaa gcatttctga gcaaacatac aggtgtgggg 1680 

aaatctcttc atcgggccag agcattgcag aaacgtcatg aagattttga agaagtggca 1740 

cagaacacat acaccaatgc ggataaatta ctggaagcag cagaacagct ggctcagact 180 0 

ggggaatgtg accccgaaga gatttatcag gctgcccatc agctggaaga ccggattcaa 1860 

gatttcgfctc ggcgtgttga gcagcgaaag atcctactgg acatgtcagt gtcctttcac 192 0 

acccatgtga aagagctgtg gacgtggctg gaggagctgc agaaggagct gctggacgac 198 0 

gtgtatgccg agtcggtgga ggccgtgcag gacctcatca agcgctttgg ccagcagcag 2 040 

cagaccaccc tgcaggtgac tgtcaacgtg atcaaggaag gggaggacct catccagcag 2100 

ctcagggact ctgccatctc cagtaacaag accccccaca acagctccat caaccacatt 2160 

gagacggtgc tgcagcagct ggacgaggcg cagtcgcaga tggaggagct cttccaggag 2220 

cgcaagatca agctggagct cttcctgcac gtgcgcatct tcgagaggga cgccatcgac 22 8 0 

attatctcag acctcgagtc ttggaatgat gagctttctc agcaaatgaa tgacttcgac 234 0 

acagaagatc tcacgattgc agagcagcgc ctccagcacc atgcagacaa agccttgacc 2400 

atgaacaact tgacttttga cgtcatccac caagggcaag atcttctgca gtatgtcaat 2460 

gaggtccagg cctctggtgt ggagctgctg tgtgatagag atgtagacat ggcaactcgg 252 0 

gtccaggacc tgctggagtt tcttcatgaa aaacagcagg aattggattt agccgcagag 258 0 

cagcatcgga aacacctgga gcagtgcgtg cagctgcgcc acctgcaggc agaagtgaaa 2 640 

caggtgctgg gttggatccg caacggagag tccatgttaa atgccggact tatcacagcc 2700 

agctcgttac aagaggcaga gcagctccag cgagagcacg agcagttcca gcatgccatt 2760 

gagaaaacac atcagagcgc gctgcaggtg cagcagaagg cagaagccat gctacaggcc 2 82 0 

aaccactacg acatggacat gatccgggac tgcgccgaga aggtggcgtc tcactggcaa 2 880 

cagctcatgc tcaagatgga agatcgcctc aagctcgtca acgcctctgt cgctttctac 2940 

aaaacctcag agcaggtctg cagcgtcctc gagagcctgg aacaggagta caagagagaa 3 000 

gaagactggt gtggcggggc ggataagctg ggcccaaact ctgagacgga ccacgtgacg 3 060 

cccatgatca gcaagcacct ggagcagaag gaggcattcc tgaaggcttg cacccttgct 312 0 

cggaggaatg cagacgtctt cctgaaatac ctgcacagga acagcgtgaa catgccagga 318 0 



-215- 



WO 02/092000 



PCT/US02/14877 



atggtgacgc acatcaaagc tcctgaacag caagtgaaaa atatcttgaa tgaactcttc 3 24 0 

caacgggaga acagggtatt gcattactgg accatgagga agagacggct ggaccagtgt 33 00 

cagcagtacg tggtctttga gaggagtgcc aagcaggctt tggaatggat ccatgacaat 3360 

ggcgagttct acctttccac acacacctcc acgggctcca gtatacagca cacccaggag 342 0 

ctcctgaaag agcacgagga gttccagata actgcaaagc aaaccaaaga gagagtgaag 34 8 0 

ctattgatac agctggctga tggcttttgt gaaaaagggc atgcccatgc ggcagagata 3540 

aaaaaatgtg ttactgctgt ggataagagg tacagagatt tctctctgcg gatggagaag 3 600 

tacaggacct ctttggaaaa agccctgggg atttcttcag attccaacaa atcgagtaaa 3 660 

agtctccagc tagatatcat tccagccagt atccctggct cagaggtgaa acttcgagat 3720 

gctgctcatg aacttaatga agagaagcgg aaatctgccc gcaggaaaga gttcataatg 37 80 

gctgagctca ttcaaactga aaaggcttat gtaagagacc tccgggaatg tatggatacg 3 84 0 

tacctgtggg aaatgaccag tggcgtggaa gagattccac ctggcattgt aaacaaagaa 3 90 0 

ctcatcatct tcggaaacat gcaagaaatc tacgaatttc ataataacat attcctaaag 3 960 

gagctggaaa aatatgaaca gttgccagag gatgttggac attgttttgt tacttgggca 4020 

gacaagtttc agatgtatgt cacatattgc aaaaataagc ctgattctac tcagctgata 4 0 80 

ttggaacatg cagggtccta ttttgacgag atacagcagc gacatggatt agccaattcc 414 0 

atttcttcct accttattaa accagttcag cgaataacga aatatcagct ccttttaaaa 42 0 0 

gagctgctga cgtgctgtga ggaaggaaag ggagagatta aagatggcct ggaggtgatg 42 60 

ctcagcgtgc cgaagcgagc caatgacgcc atgcacctca gcatgctgga agggtttgat 432 0 

gaaaacattg agtctcaggg agaactcatc ctacaggaat ccttccaagt gtgggaccca 4380 

aaaaccttaa ttcgaaaggg tcgagaacgg catctcttcc tttttgaaat gtccttagta 4440 

tttagtaaag aagtgaaaga ttccagtggg agaagcaagt acctttataa aagcaaattg 4500 

tttacctcag agttgggtgt cacagaacat gttgaaggag acccttgcaa atttgcactg 4560 

tgggtgggga gaacaccaac ttcagataat aaaattgtcc ttaaggcttc cagcatagag 4620 

aacaagcagg actggataaa gcatatccgc gaagtcatcc aggagcggac gatccacctg 4680 

aagggagccc tgaaggagcc cattcacatc cctaagaccg ctcccgccac aagacagaag 4740 

ggaaggaggg atggagagga tctggacagc caaggagacg gcagcagcca gcctgatacg 4 800 

atttccatcg cctcacggac gtctcagaac acgctggaca gcgataagct ctctggtggc 4860 

tgtgagctga cagtggtgat ccatgacttc accgcttgca acagcaacga gctgaccatc 4920 

cgacggggcc agaccgtgga agttctggag cggccgcatg acaagcctga ctggtgtctg 4980 

gtgcggacca ctgaccgctc cccagcggca gaaggcctgg tcccctgtgg ttcactgtgc 5040 

atcgcccact ccagaagtag catggaaatg gagggcatct tcaaccacaa agacfccgctc 5100 

tccgtctcca gcaatgacgc cagtccaccc gcatccgtgg cttccctcca gccccacatg 5160 

atcggggccc agagctcgcc gggccccaag cggccgggca acaccctgcg caagtggctc 5220 

accagccccg tgcggcggct cagcagcggc aaggccgacg ggcacgtgaa gaagctggcg 52 80 

cacaagcaca agaagagccg cgaggtccgc aagagcgccg acgccggctc gcagaaggac 5340 

tccgacgaca gtgcggccac cccgcaggac gagacggtcg aggagagagg ccggaacgag 5400 

ggcctgagca gcggtactct ctccaaatcc tcctcctcgg ggatgcagag ctgtggagaa 54 60 

gaggaaggcg aggagggggc cgacgccgtg cccctgccgc cacccatggc catccagcag 5520 

cacagcctcc tccagccaga ctcacaggat gacaaggcct cttctcggtt attagtccgc 55 8 0 

cccaccagct ccgaaacacc gagtgcagcc gagctcgtca gtgcaattga ggaactcgtg 564 0 

aaaagcaaga tggcactgga ggatcgcccc agctcactcc ttgttgacca gggagatagt 5 700 

agcagccctt ccttcaaccc ttcggataat tcccttctct cttcctcctc gcccattgat 5760 

gagatggaag aaaggaaatc cagctcttta aagagaagac actacgtttt gcaagaacta 5 820 

gtggagacag agcgtgacta tgtgcgggac cttggctatg tggttgaggg ctacatggca 5880 

cttatgaaag aagatggtgt tcctgatgac atgaaaggaa aagacaaaat tgtgttcggc 5940 

aacatccatc agatttacga ctggcacaga gacttttttt taggagagtt agagaagtgc 6000 

cttgaagatc cagaaaaact aggatccctt tttgttaaac acgagagaag gttgcacatg 6060 

tacatagctt attgtcaaaa taaaccaaag tctgagcaca ttgtctcaga atacattgat 612 0 

accttttttg aggacttaaa gcagcgtctt ggccacaggt tacagctcac agatctgttg 6180 

atcaaaccag tgcagagaat catgaagtat cagctgttac tgaaggactt cctcaagtat 6240 

tccaaaaagg ccagcctgga tacatcagaa ttagagagag ctgtggaagt catgtgcata 6300 

gtacccaggc ggtgcaacga catgatgaac gtggggcggc tgcaaggatt cgacgggaaa 6360 

atcgttgccc agggtaaact gctcttgcag gacacattct tggtcacaga ccaagatgca 6420 
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ggacttctgc ctcgctgcag agagaggcgc atcttcctct ttgagcagat cgtcatattc 6480 

agcgaaccac ttgataaaaa gaagggcttc tccatgccgg gattcctgtt taagaacagt 6540 

atcaaggtga gttgcctttg cctggaggaa aatgtggaaa atgatccctg taaatttgct 6600 

ctgacatcga ggacgggtga cgtggtagag accfctcattt tgcattcatc tagtccaagt 6660 

gtccggcaaa cttggatcca tgaaatcaac caaattttag aaaaccagcg caatttttta 6720 

aatgccttga catcgccaat cgagtaccag aggaaccaca gcgggggcgg cggcggcggc 6780 

ggcagcgggg cagcggcggg ggtgggggca gcggcggcgg cggggccccc agtggcggca 6840 

gcggccacag tggcggcccc agcagctgcg gcggcgcccc cagcacgagc aggagccggc 6900 

cctcccggat cccccagcct gtccgacacc acccccccgt gctggtctcc tctgcagcct 6960 

cgagccaggc agaggcagac aagatgtcag agtgaaagca gcagcagtag caacatctcc 702 0 

accatgttgg tgacacacga ttacacggca gtgaaggagg atgagatcaa cgtctaccaa 7080 

ggagaggtcg ttcaaattct ggccagcaac cagcagaaca tgtttctggt gttccgagcc 7140 

gccactgacc agtgccccgc agctgagggc tggattccag gctttgtcct gggccacacc 720 0 

agtgcagtca tcgtggagaa cccggacggg actctcaaga agtcaacatc ttggcacaca 7260 

gcactccgtt taaggaaaaa atctgagaaa aaagataaag acggcaaaag ggaaggcaag 7320 

ttagagaacg gttatcggaa gtcacgggaa ggactcagca acaaggtatc tgtgaagctt 7380 

ctcaatccca actacattta tgacgttccc ccagaattcg tcattccatt gagtgaggtc 7440 

acgtgtgaga caggggagac cgttgttctt agatgtcgag tctgtggccg ccccaaagcc 7500 

tcaattacct ggaagggccc tgaacacaac accttgaaca acgatggtca ctacagcatc 7560 

tcctacagtg acctgggaga ggccacgctg aagattgtgg gcgtgaccac ggaagatgac 7620 

ggcatctaca cgtgcatcgc tgtcaatgac atgggttcag cctcatcatc ggccagcctg 7680 

agggtcctag gtccagggat ggatgggatc atggtgacct ggaaagacaa ctttgact cc 7740 

ttctacagtg aagtggctga gcttggcagg ggcagattct ctgtcgttaa gaaatgtgat 7800 

cagaaaggaa ccaagcgagc agtggccact aagtttgtga acaagaagtt gafcgaagcgc 7860 

gaccaggtca cccatgagct tggcatcctg cagagcctcc agcaccccct gcttgtcggc 7920 

ctcctcgaca cctttgagac ccccaccagc tacatcctgg tcttagaaat ggctgaccag 7980 

ggtcgcctcc tggactgcgt ggtgcgatgg ggaagcctca ctgaagggaa gatcagggcg 8 040 

cacctggggg aggttctgga agctgtccgg tacctgcaca actgcaggat agcacacctg 810 0 

gacctaaagc ctgagaatat cctggtggat gagagtttag ccaagccaac catcaaactg 8160 

gctgactttg gagatgctgt tcagctcaac acgacctact acatccacca gttactgggg 822 0 

aaccctgaat tcgcagcccc tgaaatcatc ctcgggaacc ctgtctccct gacctcggat 8280 

acgtggagtg ttggagtgct cacatacgta cttcttagtg gcgtgtcccc cttcctggat 834 0 

gacagtgtgg aagagacctg cctgaacatt tgccgcttag actttagctt cccagatgac 8400 

tactttaaag gagtgagcca gaaggccaag gagttcgtgt gcttcctcct gcaggaggac 8460 

cccgccaagc gtccctcggc tgcgctggcc ctccaggagc agtggctgca ggccggcaac 852 0 

ggcagaagca cgggcgtcct cgacacgtcc agactgactt ccttcattga gcggcgcaaa 858 0 

caccagaatg atgttcgacc tatccgtagc attaaaaact ttctgcagag caggcttctg 8640 

cctagagttt gacctatcca gaagttcttt ctcattctct ttcacctgcc aatcagctgt 8700 

taatctgaat tttcaagaga aaacaagcaa acataactga tcagctgccg gtatgttcat 8760 

cgtgtgaaat tgcattccaa gtgagctgtg ctcagcagtg cttggacaca gagctgcaag 882 0 

ctgcgctggg gtggaggacc gtcacttaca ctctgccaag gacggaggtc gcattgctgt 888 0 

atcacagtat tttttacgga tttctg 8906 



<210> 665 
<211> 1204 
<212> DNA 

<213> Homo sapiens 
<400> 665 

tcggcggcgg tggtatcggc ggcagctgtg agggggttcc gggaagatgg tgctgatcaa 60 
ggaattccgt gtggttttgc catgttctgt tcaggagtat caggttgggc agctttactc 12 0 
tgttgcagaa gctagtaaga atgagactgg tggtggagaa ggaattgaag tcttaaagaa 180 
tgaaccttat gagaaggatg gagaaaaggg acagtatacg cacaaaattt atcacctaaa 24 0 
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gagcaaagtg cctgcattcg 
gaaagcctgg aatgcgtacc 
tgatttcttc afctaaaatcg 
acatggttta gatccaaaca 
tagaagtcaa gttgaaccag 
agtcaagacc aagagaggcc 
tgactgtccc cagatgtgtg 
gcaaagcaaa gtagaaaact 
tcgccagctt ttttgttgga 
aatggaagac gagactcaga 
cacgtcggct gctgatgtct 
ttacgtaatc aaggtcaagt 
tgaccagtat cttgcagtgt 
tgcacaggtt ctcaaccacg 
taaagcgcat gtagagctac 
acatactgaa gggattagta 
caaa 

<210> 666 
<211> 892 
<212> PRT 
<213> Homo sapiens 



<400> 666 
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tgaggatgat tgctcccgag ggctccttgg tgtttcatga 3 00 

cctactgtag aacaattgta acgaatgaat atatgaaaga 360 

aaacatggca caaaccagac ttgggaacat tagaaaatgt 420 

catggaaaac tgttgaaatt gtccatatag atattgeaga 480 

cagactacaa agctgatgaa gacccagcat tattccagtc 540 

ctttgggacc caactggaag aaggagctgg caaacagccc 600 

cctataagct ggtgaccatc aaattcaagt ggtggggact 660 

tcattcaaaa gcaagaaaaa eggatattta caaacttcca 720 

ttgacaagtg gatcgatctc acgatggaag acattaggag 780 

aagaactaga aacaatgegt aagaggggtt ccgttcgagg 840 

agatgagtcc cctgtagggt cagagacaat gtcaaactgt 900 

gaggggaaca agcgcagcca gtgatgagtg aacaacaatc 960 

tgacgtttcc cagatgtgtg cttgtgatga tacacacaca 1020 

tgtgtatata tgtatgtgtg catatgtctg tagctgtata 1080 

agatccagat acacacactt gtgtatatat gtacatacag 1140 

caatttctcc aaagtactgt acctatcttc agcaagaatg 12 00 

1204 
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Tyr Val Ser Ser Phe Tyr His Ala Phe Ser Gly Ala Gin Lys Ala Glu 

245 250 " 255 

Thr Ala Ala Asn Arg He Cys Lys Val Leu Ala Val Asn Gin Glu Asn 

260 265 270 

Glu Gin Leu Met Glu Asp Tyr Glu Lys Leu Ala Ser Asp Leu Leu Glu 

275 280 285 

Trp He Arg Arg Thr He Pro Trp Leu Glu Asn Arg Val Pro Glu Asn 

290 295 300 

Thr Met His Ala Met Gin Gin Lys Leu Glu Asp Phe Arg Asp Tyr Arg 
305 310 315 ~ " 320 

Arg Leu His Lys Pro Pro Lys Val Gin Glu Lys Cys Gin Leu Glu He 

325 330 " 335 

Asn Phe Asn Thr Leu Gin Thr Lys Leu Arg Leu Ser Asn Arg Pro Ala 

340 345 350 

Phe Met Pro Ser Glu Gly Arg Met Val Ser Asp He Asn Asn Ala Trp 

355 360 365 

Gly Cys Leu Glu Gin Val Glu Lys Gly Tyr Glu Glu Trp Leu Leu Asn 

370 375 380 

Glu He Arg Arg Leu Glu Arg Leu Asp His Leu Ala Glu Lys Phe Arg 
385 390 395 ~ 400 

Gin Lys Ala Ser He His Glu Ala Trp Thr Asp Gly Lys Glu Ala Met 

405 410 " * " 415 

Leu Arg Gin Lys Asp Tyr Glu Thr Ala Thr Leu Ser Glu He Lys Ala 

420 425 430 

Leu Leu Lys Lys His Glu Ala Phe Glu Ser Asp Leu Ala Ala His Gin 

435 440 445 

Asp Arg Val Glu Gin He Ala Ala He Ala Gin Glu Leu Asn Glu Leu 

450 455 460 

Asp Tyr Tyr Asp Ser Pro Ser Val Asn Ala Arg Cys Gin Lys He Cys 
465 470 475 480 

Asp Gin Trp Asp Asn Leu Gly Ala Leu Thr Gin Lys Arg Arg Glu Ala 

485 490 495 

Leu Glu Arg Thr Glu Lys Leu Leu Glu Thr He Asp Gin Leu Tyr Leu 

500 505 510 

Glu Tyr Ala Lys Arg Ala Ala Pro Phe Asn Asn Trp Met Glu Gly Ala 

515 520 *" 525 

Met Glu Asp Leu Gin Asp Thr Phe He Val His Thr lie Glu Glu He 

530 535 540 

Gin Gly Leu Thr Thr Ala His Glu Gin Phe Lys Ala Thr Leu Pro Asp 
545 550 555 560 

Ala Asp Lys Glu Arg Leu Ala He Leu Gly He His Asn Glu Val Ser 
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35 40 45 

Lys Leu Ala Ser Glu Lys Ser Glu Met Gin Arg His Tyr Val Met Tyr 

50 55 60 

Tyr Glu Met Ser Tyr Gly Leu Asn He Glu Met His Lys Gin Ala Glu 
65 70 75 80 

He Val Lys Arg Leu Asn Gly He Cys Ala Gin Val Leu Pro Tyr Leu 

85 90 95 
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115 120 125 
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<213> Homo sapiens 
<400> 674 

Met Trp Arg Val Lys Lys Leu Ser Leu Ser Leu Ser Pro Ser Pro Gin 

15 10 15 

Thr Gly Lys Pro Ser Met Arg Thr Pro Leu Arg Glu Leu Thr Leu Gin 

20 25 30 

Pro Gly Ala Leu Thr Thr Ser Gly Lys Arg Ser Pro Ala Cys Ser Ser 

35 40 45 

Leu Thr Pro Ser Leu Cys Lys Leu Gly Leu Gin Glu Gly Ser Asn Asn 

50 55 ~ 60 

Ser Ser Pro Val Asp Phe Val Asn Asn Lys Arg Thr Asp Leu Ser Ser 
65 70 75 80 

Glu His Phe Ser His Ser Ser Lys Trp Leu Glu Thr Cys Gin His Glu 

85 90 95 

Ser Asp Glu Gin Pro Leu Asp Pro lie Pro Gin lie Ser Ser Thr Pro 

100 105 110 

Lys Thr Ser Glu Glu Ala Val Asp Pro Leu Gly Asn Tyr Met Val Lys 

115 120 125 

Thr lie Val Leu Val Pro Ser Pro Leu Gly Gin Gin Gin Asp Met lie 

13 0 135 ^ 140 

Phe Glu Ala Arg Leu Asp Thr Met Ala Glu Thr Asn Ser He Ser Leu 
145 150 155 160 

Asn Gly Pro Leu Arg Thr Asp Asp Leu Val Arg Glu Glu Val Ala Pro 

165 170 175 

Cys Met Gly Asp Arg Phe Ser Glu Val Ala Ala Val Ser Glu Lys Pro 

180 185 190 

He Phe Gin Glu Ser Pro Ser His Leu Leu Glu Glu Ser Pro Pro Asn 

195 200 205 

Pro Cys Ser Glu Gin Leu His Cys Ser Lys Glu Ser Leu Ser Ser Arg 

210 215 220 

Thr Glu Ala Val Arg Glu Asp Leu Val Pro Ser Glu Ser Asn Ala Phe 
225 230 235 240 

Leu Pro Ser Ser Val Leu Trp Leu Ser Pro Ser Thr Ala Leu Ala Ala 

245 250 255 

Asp Phe Arg Val Asn His Val Asp Pro Glu Glu Glu He Val Glu His 

260 265 270 

Gly Ala Met Glu Glu Arg Glu Met Arg Phe Pro Thr His Pro Lys Glu 

275 280 285 

Ser Glu Thr Glu Asp Gin Ala Leu Val Ser Ser Val Glu Asp He Leu 

290 295 300 

Ser Thr Cys Leu Thr Pro Asn Leu Val Glu Met Glu Ser Gin Glu Ala 
305 310 315 320 

Pro Gly Pro Ala Val Glu Asp Val Gly Arg He Leu Gly Ser Asp Thr 

325 330 335 

Glu Ser Trp Met Ser Pro Leu Ala Trp Leu Glu Lys Gly Val Asn Thr 

340 345 350 

Ser Val Met Leu Glu Asn Leu Arg Gin Ser Leu Ser Leu Pro Ser Met 

355 360 365 

Leu Arg Asp Ala Ala He Gly Thr Thr Pro Phe Ser Thr Cys Ser Val 

370 375 380 

Gly Thr Trp Phe Thr Pro Ser Ala Pro Gin Glu Lys Ser Thr Asn Thr 
385 390 395 ~ 400 

Ser Gin Thr Gly Leu Val Gly Thr Lys His Ser Thr Ser Glu Thr Glu 
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Lys 


Gin 








6R 5 

O O J 








Car 
□ Cl 


Ala 


Gin 


Leu 


Glu 


Glu 


Cys 






700 

/ L/ W 










Glu 


Asn 


lie 


Atct 


Leu 


Ala 


Thr 




71 5 
/ i j 










720 


Al 3 


Asn 


riC L» 


Asp 


Ser 


Gin 


Leu 


7^0 










7*^5 






Ala 


Gln 


Asp 


Leu 


Ala 


Met 










750 






Gin 


Ser 


Asn 


Glu 


Glu 


Gin 


Ala 








765 








ilCU 


xjy o 


XIX o 


nc L- 


m n 

w-LXX 


Ala 


VJ -L LI 






7ft 0 










Lys 


Glu 


Val 




Asp 


Leu 


Lys 




795 










800 


Asn 


Gin 


Val 


Ala 


His 


Leu 


Glu 


810 










815 




Thr 


Thr 


Leu 


Glu 


Val 


Leu 


Arg 










830 






Lys 


Asp 


Thr 


Val 


Glu 


Asn 


Leu 
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835 










840 










845 








Thr Ala 


Lys 


Leu 


Ala 


Ser 


Thr 


He 


Ala 


Asp 

IT 


Asn 


Gin 


Glu 


Gin 


Asp 


Leu 


850 










855 










860 










Glu Lys 


Thr 


Arg 


Gin 


Tyr 


Ser 


Gin 


Lys 


Leu 


Gly 


Leu 


Leu 


Thr 


Glu 


Gin 


865 








870 










875 










880 


Leu Gin 


Ser 


Leu 


Thr 


Leu 


Phe 


Leu 


Gin 


Thr 


Lys 


Leu 


Lys 


Glu 


Lys 


Thr 








885 










890 










895 




Glu Gin 


Glu 


Thr 


Leu 


Leu 


Leu 


Ser 


Thr 


Ala 


Cys 


Pro 


Pro 


Thr 


Gin 


Glu 






900 










905 










910 






His Pro 


Leu 


Pro 


Asn 


Asp 


Arg 


Thr 


Phe 


Leu 


Gly 


Ser 


He 


Leu 


Thr 


Ala 




915 










920 










925 








Val Ala 


Asd 


Glu 


Glu 


Pro 


Glu 


Ser 


Thr 


Pro 


Val 


Pro 


Leu 


Leu 


Gly 


Ser 


930 










935 










940 










Asp Lys 


Ser 


Ala 


Phe 


Thr 


Ara 


Val 


Ala 


Ser 


Met 


Val 


Ser 


Leu 


Gin 


Pro 


945 








950 










955 










960 


Ala Glu 


Thr 


Pro 


Gly 


Met 


Glu 


Glu 


Ser 


Leu 


Ala 


Glu 


Met 


Ser 


He 


Met 








965 










970 










975 




Thr Thr 


Glu 


Leu 


Gin 


Ser 


Leu 


Cys 


Ser 


Leu 


Leu 


Gin 


Glu 


Ser 


Lys 


Glu 






980 










985 










990 






Glu Ala 


He 


Arg 


Thr 


Leu 


Gin 


Arg 


Lys 


He 


Cys 


Glu 


Leu 


Gin 


Ala 


Arg 




995 










1000 








1005 






Leu Gin 


Ala 


Gin 


Glu 


Glu 


Gin 


His 


Gin 


Glu 


Val 


Gin 


Lys Ala 


Lys 


Glu 


1010 








1015 








1020 








Ala Asp 


He 


Glu 


Lys 


Leu 


Asn 


Gin 


Ala 


Leu 


Cys 


Leu Arg Tyr 


Lys 


Asn 


1025 








1030 








1035 








104( 


Glu Lys 


Glu 


Leu 


Gin 


Glu 


Val 


He 


Gin 


Gin 


Asn Glu Lys 


He 


Leu 


Glu 








1045 








1050 








1055 


Gin He 


Asp 


Lys 


Ser 


Gly Glu Leu 


He 


Ser Leu Arg Glu Glu 


Val 


Thr 






1060 








1065 








1070 




His Leu 


Thr 


Arg 


Ser 


Leu Arg Arg Ala Glu Thr Glu Thr Lys Val Leu 




1075 








1080 








1085 






Gin Glu Ala Trp Gin 


Ala 


Ser 


Trp 


Thr 


Pro 


Thr 


Ala 


Ser 


Leu 


Trp 


Pro 


1090 








1095 








1100 








Pro He Gly Ser Arg 


Arg Lys 


Cys 


Gly Ser Leu Arg Arg 


Trp 


Thr 


Asn 


1105 








1110 








1115 








1121 


<210> 675 




























<211> 540 




























<212> PRT 




























<213> Homo sapiens 
























<400> 675 




























Met Gly 


Thr 


Thr 


Ala 


Arg Ala 


Ala 


Leu 


Val 


Leu 


Thr Tyr 


Leu 


Ala 


Val 


1 






5 










10 










15 




Ala Ser 


Ala 


Ala 


Ser 


Glu Gly 


Gly Phe Thr Ala 


Thr Gly 


Gin Arg Gin 






20 










25 










30 






Leu Arg 


Pro 


Glu 


His 


Phe 


Gin 


Glu 


Val 


Gly Tyr 


Ala 


Ala 


Pro 


Pro 


Ser 




35 










40 










45 








Pro Pro 


Leu 


Ser 


Arg 


Ser 


Leu 


Pro Met Asp His 


Pro Asp 


Ser 


Ser 


Gin 


50 










55 










60 










His Gly 


Pro 


Pro 


Phe 


Glu Gly 


Gin 


Ser 


Gin 


Val 


Gin 


Pro 


Pro 


Pro 


Ser 


65 








70 










75 










80 


Gin Glu 


Ala 


Thr 


Pro 


Leu 


Gin 


Gin 


Glu 


Lys 


Leu 


Leu 


Pro 


Ala 


Gin 


Leu 



85 90 95 



-233- 



WO 02/092000 



PCT/US02/14877 



Pro 


Ala 


Glu 


Lys 
i oo 


Glu 


Val 


Gly 


Pro 


Pro 

X \J -J 


Leu 


Pro 


Gin 


Glu 


Ala 
110 


Val 


Pro 


ucu 




Lys 

lie; 
x J-O 


VJ X u. 


Leu 




Ser 


Leu 
i 7 n 


Gin 


His 


Pro 


Asn 


Glu 
1 25 


Gin 


Lvs 


Glu 


Gly 


Thr 

Tin 
1 j u 


Pro 


Ala 


Pro 


Phe 


Gly 

1 

Xj J 


Asp 


Gin 


Ser 


His 


Pro 

140 


Glu 


Pro 


Glu 


Ser 


Trp 


Asn 


Ala 


Ala 


Gin 


His 


Cys 


Gin 


Gin 


Asp 


Arg 


Ser 


Gin 


Gly 


Gly 


Trp 


1 AC 










1 C ft 










1 cc 

X -J Zj 










160 


Gly 


His 


Arg 


Leu 


Asp 

X \J ZJ 


Gly 


Phe 


Pro 


Pro 


Gly 
170 


Arg 


Pro 


Ser 


Pro 


Asp 
175 


Asn 


Leu 


Asn 


Gin 


He 

1 ft D 


Cys 


Leu 


Pro 


Asn 


Arg 
185 


Gin 


His 


Val 


Val 


Tyr 
190 


Gly 


Pro 








.t X w 


Gin 


Ser 


Ser 


Tvr 

y 

700 
\j \j 


Ser 


His 


Leu 


Thr 


Arcr 
205 


Gin 


Gly 


Glu 


Thr 


Leu 

ZlU 


Asn 


Phe 


Leu 


Glu 


He 

7 1 c 


Gly 


Tyr 


Ser 


Arg 


Cys 

77 0 


Cys 


His 


Cys 


Arg 


Ser 


His 


Thr 


Asn 


Arg 


Leu 


Glu 


Cys 


Ala 


Lys 


Leu 


Val 


Trp 


Glu 


Glu 


Ala 


— \ o c 
ZAD 




















7 "X 
£* 3 D 










240 


Met 


Ser 


Arg 


Phe 


Cys 

74 *Z 


Glu 


Ala 


Glu 


Phe 


Ser 
250 


Val 


Lys 


Thr 


Arg 


Pro 
255 


His 


Trp 


Cys 


Cys 


Thr 
7 £ n 

Z D U 


Arg 


Gin 


Gly 


Glu 


Ala 
265 


Arg 


Phe 


Ser 


Cys 


Phe 
270 


Gin 


Glu 


vJJX U. 




JTi. vJ 

OIK 


m ti 


-tr J- w 


His 


xyx 


Gin 
o r n 


Leu 


Aro 


Ala 


Cvs 


Pro 
285 


Ser 


His 


Gin 


Pro 


Asp 

o q n 

a y U 


He 


Ser 


Ser 


Gly 


Leu 

7 Q 


Glu 


Leu 


Pro 


Phe 


Pro 

7. 0 n 


Pro 


Gly 


Val 


Pro 


Thr 


Leu 


Asp 


Asn 


He 


Lys 


Asn 


He 


Cys 


His 


Leu 


Arg 


Arg 


Phe 


Arg 


Ser 


*3 n tr 
3 05 






























32 0 


Val 


Pro 


Arg 


Asn 


Leu 


Pro 


Ala 


Thr 


Asp 


Pro 
j j \j 


Leu 


Gin 


Arg 


Glu 


Leu 
«-> —? 


Leu 


Ala 


Leu 


He 


Gin 
*xa n 


Leu 


Glu 


Arg 


Glu 


Phe 


Gin 


Arg 


Cys 


Cys 


Arg 
350 


Gin 


Gly 


7\ en 

Hbll 


7\ c n 


n x o 


Thr* 
j. nx 




Thr 




Lys 

J D U 


Ala 


Trn 

-Lip 


Glu 


Asp 


Thr 
365 


Leu 


Asp 


Lys 


Tyr 


Cys 
t *7 n 


Asp 


Arg 


Glu 


Tyr 


Ala 

J / D 


Val 


Lys 


Thr 


His 


His 

-> 0 \j 


His 


Leu 


Cys 


Cys 


Arg 


His 


Pro 


Pro 


Ser 


Pro 


Thr 


Arg 


Asp 


Glu 


Cys 


Phe 


Ala 


Arg 


Arg 


Ala 


*a q c 
job 










■son 

J J7 U 




















400 


Pro 


Tyr 


Pro 


Asn 


Tyr 
*± u s 


Asp 


Arg 


Asp 


He 


Leu 
41 0 

*± x VJ 


Thr 


He 


Asp 


He 


Ser 

41 R 

*± X ZJ 


Arg 


Val 


Thr 


Pro 


Asn 
47 0 

*fc U 


Leu 


Met 


Gly 


His 


Leu 


Cys 


Gly 


Asn 


Gin 


Arg 
430 


Val 


Leu 


TVl T" 


uy o 


n x o 
4 *3 R 


TiV«; 

XJjr «3 


His 


He 




Gly 


Leu 


He 


His 


Asn 


Met 
445 


Thr 


Ala 


Arcr 


Cys 


Cys 
/en 

ft D U 


Asp 


Leu 


Pro 


Phe 


Pro 

*± D -3 


Glu 


Gin 


Ala 


Cys 


Cys 

TtOU 


Ala 


Glu 


Glu 


Glu 


Lys 


Leu 


Thr 


Phe 


He 


Asn 


Asp 


Leu 


Cys 


Gly 


Pro 


Arg 


Arg 


Asn 


He 


Trp 


4 bb 










a 7 n 




















480 


Arg 


Asp 


Pro 


Ala 


Leu 

A Q C 

4ob 


Cys 


Cys 


Tyr 


Leu 


Ser 


Pro 


Gly 


Asp 


Glu 


Gin 

*± J? Zj 


Val 


Asn 


Cys 


Phe 


Asn 
500 


He 


Asn 


Tyr 


Leu 


Arg 
505 


Asn 


Val 


Ala 


Leu 


Val 
510 


Ser 


Gly 


Asp 


Thr 


Glu 
515 


Asn 


Ala 


Lys 


Gly 


Gin 
520 


Gly 


Glu 


Gin 


Gly 


Ser 
525 


Thr 


Gly 


Gly 



-234- 



WO 02/092000 



PCT/US02/14877 



Thr Asn lie Ser Ser 
530 



Thr Ser Glu 
535 



Pro Lys Glu Glu 
540 



<210> 676 
<211> 462 
<212> PRT 
<213> Homo sapiens 

<400> 676 

Met Gly Lys Glu Lys Thr His lie Asn He Val Val He Gly His Val 

15 10 15 

Asp Ser Gly Lys Ser Thr Thr Thr Gly His Leu He Tyr Lys Cys Gly 

20 25 30 

Gly He Asp Lys Arg Thr He Glu Lys Phe Glu Lys Glu Ala Ala Glu 

35 40 45 

Met Gly Lys Gly Ser Phe Lys Tyr Ala Trp Val Leu Asp Lys Leu Lys 

50 55 60 

Ala Glu Arg Glu Arg Gly He Thr He Asp He Ser Leu Trp Lys Phe 
65 70 75 80 

Glu Thr Ser Lys Tyr Tyr Val Thr He He Asp Ala Pro Gly His Arg 

85 90 95 

Asp Phe He Lys Asn Met He Thr Gly Thr Ser Gin Ala Asp Cys Ala 

100 105 110 

Val Leu He Val Ala Ala Gly Val Gly Glu Phe Glu Ala Gly He Ser 

115 120 125 

Lys Asn Gly Gin Thr Arg Glu His Ala Leu Leu Ala Tyr Thr Leu Gly 

130 135 140 

Val Lys Gin Leu He Val Gly Val Asn Lys Met Asp Ser Thr Glu Pro 
145 150 155 160 

Pro Tyr Ser Gin Lys Arg Tyr Glu Glu He Val Lys Glu Val Ser Thr 

165 170 175 

Tyr He Lys Lys He Gly Tyr Asn Pro Asp Thr Val Ala Phe Val Pro 

180 185 190 

He Ser Gly Trp Asn Gly Asp Asn Met Leu Glu Pro Ser Ala Asn Met 

195 200 205 

Pro Trp Phe Lys Gly Trp Lys Val Thr Arg Lys Asp Gly Asn Ala Ser 

210 215 220 

Gly Thr Thr Leu Leu Glu Ala Val Asp Cys He Leu Pro Pro Thr Arg 
225 230 235 240 

Pro Thr Asp Lys Pro Leu Arg Leu Pro Leu Gin Asp Val Tyr Lys lie 

245 250 255 

Gly Gly He Gly Thr Val Pro Val Gly Arg Val Glu Thr Gly Val Leu 

260 265 270 

Lys Pro Gly Met Val Val Thr Phe Ala Pro Val Asn Val Thr Thr Glu 

275 280 285 

Val Lys Ser Val Glu Met His His Glu Ala Leu Ser Glu Ala Leu Pro 

290 295 300 

Gly Asp Asn Val Gly Phe Asn Val Lys Asn Val Ser Val Lys Asp Val 
305 310 315 320 

Arg Arg Gly Asn Val Ala Gly Asp Ser Lys Asn Asp Pro Pro Met Glu 

325 330 335 

Ala Ala Gly Phe Thr Ala Gin Val He He Leu Asn His Pro Gly Gin 

340 345 350 

He Ser Ala Gly Tyr Ala Pro Val Leu Asp Cys His Thr Ala His He 
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355 360 365 



Ala 


Cys 
370 


Lys 


Phe 


Ala 


Glu 


Leu 
375 


Lys 


Glu 


Lys 


He 


Asp 
380 


Arg 


Arg 


Ser 


Gly 


Lys 


Lys 


Leu Glu Asp 


Gly Pro 


Lys 


Phe 


Leu 


Lys 


Ser 


Gly 


Asp 


Ala 


Ala 


385 










390 










395 










400 


He 


Val 


Asp Met 


Val 


Pro 


Gly 


Lys 


Pro 


Met 


Cys 


Val 


Glu 


Ser 


Phe 


Ser 










405 










410 










415 




Asp 


Tyr 


Pro 


Pro 


Leu 


Gly Arg 


Phe 


Ala 


Val 


Arg 


Asp 


Met 


Arg 


Gin 


Thr 








420 










425 










430 






Val 


Ala 


Val Gly Val 


He 


Lys 


Ala 


Val 


Asp 


Lys 


Lys 


Ala 


Ala 


Gly 


Ala 






435 










440 










445 








Gly 


Lys 


Val 


Thr Lys 


Ser 


Ala 


Gin 


Lys 


Ala 


Gin 


Lys 


Ala 


Lys 








450 










455 










460 










<210> 677 




























<211> 2328 




























<212> PRT 




























<213> Homo sapiens 
























<400> 677 




























Lys 


Ser 


Lys 


Arg 


Gin 


Ala 


Gin 


Gin 


Met 


Val 


Gin 


Pro 


Gin 


Ser 


Pro 


Val 


1 








5 










10 










15 




Ala 


Val 


Ser 


Gin 
20 


Ser 


Lys 


Pro 


Gly 


Cys 
25 


Tyr 


Asp 


Asn 


Gly 


Lys 
30 


His 


Tyr 


Gin 


He 


Asn 
35 


Gin 


Gin 


Trp 


Glu 


Arg 
40 


Thr 


Tyr 


Leu 


Gly 


Asn 
45 


Val 


Leu 


Val 


Cys 


Thr 


Cys 


Tyr 


Gly 


Gly Ser 


Arg 


Gly 


Phe 


Asn 


Cys 


Glu 


Ser 


Lys 


Pro 




50 










55 










60 










Glu 


Ala 


Glu 


Glu 


Thr 


Cys 


Phe 


Asp 


Lys 


Tyr 


Thr 


Gly 


Asn 


Thr 


Tyr 


Arg 


65 










70 










75 










80 


Val 


Gly 


Asp 


Thr 


Tyr 


Glu Arg 


Pro 


Lys 


Asp 


Ser 


Met 


He 


Trp 


Asp 


Cys 










85 










90 










95 




Thr 


Cys 


He 


Gly 


Ala 


Gly Arg 


Gly 


Arg 


He 


Ser 


Cys 


Thr 


He 


Ala 


Asn 








100 










105 










110 






Arg 


Cys 


His 


Glu 


Gly 


Gly Gin 


Ser 


Tyr 


Lys 


He 


Gly 


Asp 


Thr 


Trp 


Arg 






115 










120 










125 








Arg 


Pro 


His 


Glu 


Thr 


Gly Gly 


Tyr 


Met 


Leu 


Glu 


Cys 


Val 


Cys 


Leu 


Gly 




130 










135 










140 










Asn 


Gly 


Lys 


Gly 


Glu 


Trp 


Thr 


Cys 


Lys 


Pro 


He. 


Ala 


Glu 


Lys 


Cys 


Phe 


145 










150 










155 










160 


Asp 


His 


Ala 


Ala 


Gly 
165 


Thr 


Ser 


Tyr 


Val 


Val 
•170 


Gly 


Glu 


Thr 


Trp 


Glu 
175 


Lys 


Pro 


Tyr 


Gin 


Gly 
180 


Trp 


Met 


Met 


Val 


Asp 
185 


Cys 


Thr 


Cys 


Leu 


Gly 
190 


Glu 


Gly 


Ser 


Gly 


Arg 
195 


He 


Thr 


Cys 


Thr 


Ser 
200 


Arg 


Asn 


Arg 


Cys 


Asn 
205 


Asp 


Gin 


Asp 


Thr 


Arg 
210 


Thr 


Ser 


Tyr 


Arg 


He 
215 


Gly 


Asp 


Thr 


Trp 


Ser 
220 


Lys 


Lys 


Asp 


Asn 


Arg 


Gly 


Asn 


Leu 


Leu 


Gin Cys 


He 


Cys 


Thr 


Gly 


Asn 


Gly 


Arg 


Gly 


Glu 


225 










230 










235 










240 


Trp 


Lys 


Cys 


Glu 


Arg 
245 


His 


Thr 


Ser 


Val 


Gin 
250 


Thr 


Thr 


Ser 


Ser 


Gly 
255 


Ser 


Gly 


Pro 


Phe 


Thr 
260 


Asp 


Val 


Arg 


Ala 


Ala 
265 


Val 


Tyr 


Gin 


Pro 


Gin 
270 


Pro 


His 
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Pro Gin Pro Pro Pro Tyr Gly His Cys Val Thr Asp Ser Gly Val Val 

275 280 285 

Tyr Ser Val Gly Met Gin Trp Leu Lys Thr Gin Gly Asn Lys Gin Met 

290 295 300 

Leu Cys Thr Cys Leu Gly Asn Gly Val Ser Cys Gin Glu Thr Ala Val 
305 * 310 315 320 

Thr Gin Thr Tyr Gly Gly Asn Leu Asn Gly Glu Pro Cys Val Leu Pro 

325 330 335 

Phe Thr Tyr Asn Gly Arg Thr Phe Tyr Ser Cys Thr Thr Glu Gly Arg 

340 345 350 

Gin Asp Gly His Leu Trp Cys Ser Thr Thr Ser Asn Tyr Glu Gin Asp 

355 360 365 

Gin Lys Tyr Ser Phe Cys Thr Asp His Thr Val Leu Val Gin Thr Gin 

370 375 380 

Gly Gly Asn Ser Asn Gly Ala Leu Cys His Phe Pro Phe Leu Tyr Asn 
385 390 395 400 

Asn His Asn Tyr Thr Asp Cys Thr Ser Glu Gly Arg Arg Asp Asn Met 

405 410 415 

Lys Trp Cys Gly Thr Thr Gin Asn Tyr Asp Ala Asp Gin Lys Phe Gly 

420 425 430 

Phe Cys Pro Met Ala Ala His Glu Glu lie Cys Thr Thr Asn Glu Gly 

435 440 445 

Val Met Tyr Arg lie Gly Asp Gin Trp Asp Lys Gin His Asp Met Gly 

450 ~ 455 460 

His Met Met Arg Cys Thr Cys Val Gly Asn Gly Arg Gly Glu Trp Thr 
465 470 475 480 

Cys He Ala Tyr Ser Gin Leu Arg Asp Gin Cys He Val Asp Asp He 

485 490 495 

Thr Tyr Asn Val Asn Asp Thr Phe His Lys Arg His Glu Glu Gly His 

500 505 510 

Met Leu Asn Cys Thr Cys Phe Gly Gin Gly Arg Gly Arg Trp Lys Cys 

515 520 525 

Asp Pro Val Asp Gin Cys Gin Asp Ser Glu Thr Gly Thr Phe Tyr Gin 

530 535 540 

He Gly Asp Ser Trp Glu Lys Tyr Val His Gly Val Arg Tyr Gin Cys 
545 550 555 560 

Tyr Cys Tyr Gly Arg Gly He Gly Glu Trp His Cys Gin Pro Leu Gin 

565 " 570 575 

Thr Tyr Pro Ser Ser Ser Gly Pro Val Glu Val Phe He Thr Glu Thr 

580 585 590 

Pro Ser Gin Pro Asn Ser His Pro He Gin Trp Asn Ala Pro Gin Pro 

595 600 605 

Ser His He Ser Lys Tyr He Leu Arg Trp Arg Pro Lys Asn Ser Val 

610 615 620 

Gly Arg Trp Lys Glu Ala Thr He Pro Gly His Leu Asn Ser Tyr Thr 
625 630 635 640 

He Lys Gly Leu Lys Pro Gly Val Val Tyr Glu Gly Gin Leu He Ser 

645 650 655 

He Gin Gin Tyr Gly His Gin Glu Val Thr Arg Phe Asp Phe Thr Thr 

660 665 670 

Thr Ser Thr Ser Thr Pro Val Thr Ser Asn Thr Val Thr Gly Glu Thr 

675 - 680 685 

Thr Pro Phe Ser Pro Leu Val Ala Thr Ser Glu Ser Val Thr Glu He 
690 695 700 



-237- 



WO 02/092000 



PCT/US02/14877 



Thr 


Ala 


Ser 


Ser 


Phe 


Val 


Val 


Ser 


Tro 


Val 


Ser 


Ala 


Ser 


Asp 


Thr 


Val 


705 










710 










715 










720 


Ser 


Glv 


Phe 


Arg 


Val 


Glu 


Tvr 


Glu 


Leu 


Ser 


Glu 


Glu 


Gly 


Asp 


Glu 


Pro 










725 










730 










735 




Gin 


Tvr 

x j a. 


Leu 


Asp 


Leu 


Pro 


Ser 


Thr 


Ala 


Thr 


Ser 


Val 


Asn 


He 


Pro 


Asp 








740 










745 










750 






Leu 


Leu 


Pro 


Gly 


Ara 


Lys 


Tvr 


He 


Val 


Asn 


Val 


Tvr 


Gin 


lie 


Ser 


Glu 






/ jj 










760 










765 

/ U J 








Asp 


Gly 


Glu 


Gin 


Ser 


Leu 


He 


Leu 


Ser 


Thr 


Ser 


Gin 


Thr 


Thr 


Ala 


Pro 




7 7 0 










775 
ii—> 










780 










Asp 


-rt. 1 Ct 




Pro 




Pro 


Thr 


Val 


Asp 


Gin 


Val 


Asp 


Asp 


Thr 


Ser 


He 


7 R5 

/ O -J 










790 










795 










800 


Val 


Val 


Arg 


Trn 


Ser 


Arcr 


Pro 


Gin 


Ala 


Pro 


He 


Thr 


Glv 


Tvr 


Ara 


He 










805 










810 










815 




Val 




Ser 


Pro 


Ser 


Val 


Glu 


Glv 


Ser 


Ser 


Thr 


Glu 


Leu 


Asn 


Leu 


Pro 








820 










825 










830 






Glu 


Thr 


Ala 


Asn 


Ser 


Val 


Thr 


Leu 


Ser 


Asp 


Leu 


Gin 


Pro 


Glv 


Val 


Gin 






r^ ^ 










R40 










R45 








lyi 




He 


Thr 

X HI 


He 


Tvr 
iy i 


Ala 


Val 


Glu 


Glu 


Asn 


Gin 


Glu 


Ser 


Thr 


Pro 














ft 5 5 










860 










V CJ.1 


Va 1 

V Ct 1 


Tie 


m n 

Olil 


ft! n 


Glu 


Thr 


Thr 


Gly 


Thr 


Pro 




Ser 


Asp 


Thr 


Val 


Q c; 
ODD 










R 7 0 










R75 

O / Zj 










880 




Op-r 
O Ci. 


Jr 1 w 


Arg 


Asp 


Leu 


Gin 


Phe 


Val 


Glu 


Val 


Thr 


Asp 


Val 


Lys 


Val 
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Glu Thr Thr He Val He Thr Trp Thr Pro Ala Pro Arg He Gly Phe 

1075 1080 1085 

Lys Leu Gly Val Arg Pro Ser Gin Gly Gly Glu Ala Pro Arg Glu Val 

1090 1095 1100 

Thr Ser Asp Ser Gly Ser He Val Val Ser Gly Leu Thr Pro Gly Val 
1105 1110 1115 1120 

Glu Tyr Val Tyr Thr He Gin Val Leu Arg Asp Gly Gin Glu Arg Asp 

1125 1130 1135 
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Ala Pro lie Val Asn Lys Val Val Thr Pro Leu Ser Pro Pro Thr Asn 

1140 1145 1150 

Leu His Leu Glu Ala Asn Pro Asp Thr Gly Val Leu Thr Val Ser Trp 

1155 1160 1165 

Glu Arg Ser Thr Thr Pro Asp lie Thr Gly Tyr Arg lie Thr Thr Thr 

1170 1175 1180 

Pro Thr Asn Gly Gin Gin Gly Asn Ser Leu Glu Glu Val Val His Ala 
1185 1190 1195 1200 

Asp Gin Ser Ser Cys Thr Phe Asp Asn Leu Ser Pro Gly Leu Glu Tyr 

1205 1210 1215 

Asn Val Ser Val Tyr Thr Val Lys Asp Asp Lys Glu Ser Val Pro lie 

1220 1225 1230 

Ser Asp Thr lie lie Pro Ala Val Pro Pro Pro Thr Asp Leu Arg Phe 

1235 1240 1245 

Thr Asn lie Gly Pro Asp Thr Met Arg Val Thr Trp Ala Pro Pro Pro 

1250 1255 1260 

Ser lie Asp Leu Thr Asn Phe Leu Val Arg Tyr Ser Pro Val Lys Asn 
1265 1270 1275 1280 

Glu Glu Asp Val Ala Glu Leu Ser lie Ser Pro Ser Asp Asn Ala Val 

1285 1290 1295 

Val Leu Thr Asn Leu Leu Pro Gly Thr Glu Tyr Val Val Ser Val Ser 

1300 1305 1310 

Ser Val Tyr Glu Gin His Glu Ser Thr Pro Leu Arg Gly Arg Gin Lys 

1315 1320 1325 

Thr Gly Leu Asp Ser Pro Thr Gly lie Asp Phe Ser Asp lie Thr Ala 

1330 1335 1340 

Asn Ser Phe Thr Val His Trp lie Ala Pro Arg Ala Thr lie Thr Gly 
1345 1350 1355 1360 

Tyr Arg lie Arg His His Pro Glu His Phe Ser Gly Arg Pro Arg Glu 

1365 1370 1375 

Asp Arg Val Pro His Ser Arg Asn Ser lie Thr Leu Thr Asn Leu Thr 

1380 1385 1390 

Pro Gly Thr Glu Tyr Val Val Ser He Val Ala Leu Asn Gly Arg Glu 

1395 1400 1405 

Glu Ser Pro Leu Leu He Gly Gin Gin Ser Thr Val Ser Asp Val Pro 

1410 1415 1420 

Arg Asp Leu Glu Val Val Ala Ala Thr Pro Thr Ser Leu Leu He Ser 
1425 1430 1435 1440 

Trp Asp Ala Pro Ala Val Thr Val Arg Tyr Tyr Arg He Thr Tyr Gly 

1445 1450 1455 

Glu Thr Gly Gly Asn Ser Pro Val Gin Glu Phe Thr Val Pro Gly Ser 

1460 1465 1470 

Lys Ser Thr Ala Thr He Ser Gly Leu Lys Pro Gly Val Asp Tyr Thr 

1475 1480 1485 

He Thr Val Tyr Ala Val Thr Gly Arg Gly Asp Ser Pro Ala Ser Ser 

1490 1495 1500 

Lys Pro He Ser He Asn Tyr Arg Thr Glu He Asp Lys Pro Ser Gin 
1505 1510 1515 1520 

Met Gin Val Thr Asp Val Gin Asp Asn Ser He Ser Val Lys Trp Leu 

1525 1530 1535 

Pro Ser Ser Ser Pro Val Thr Gly Tyr Arg Val Thr Thr Thr Pro Lys 

1540 1545 1550 

Asn Gly Pro Gly Pro Thr Lys Thr Lys Thr Ala Gly Pro Asp Gin Thr 
1555 1560 1565 
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Glu Met Thr He Glu Gly Leu Gin Pro Thr Val Glu Tyr Val Val Ser 

1570 1575 1580 

Val Tyr Ala Gin Asn Pro Ser Gly Glu Ser Gin Pro Leu Val Gin Thr 
1585 * 1590 1595 1600 

Ala Val Thr Asn He Asp Arg Pro Lys Gly Leu Ala Phe Thr Asp Val 

1605 1610 1615 

Asp Val Asp Ser He Lys He Ala Trp Glu Ser Pro Gin Gly Gin Val 

1620 1625 1630 

Ser Arg Tyr Arg Val Thr Tyr Ser Ser Pro Glu Asp Gly lie His Glu 

1635 1640 1645 

Leu Phe Pro Ala Pro Asp Gly Glu Glu Asp Thr Ala Glu Leu Gin Gly 

1650 1655 1660 

Leu Arg Pro Gly Ser Glu Tyr Thr Val Ser Val Val Ala Leu His Asp 
1665 " 1670 1675 1680 

Asp Met Glu Ser Gin Pro Leu He Gly Thr Gin Ser Thr Ala He Pro 

1685 1690 1695 

Ala Pro Thr Asp Leu Lys Phe Thr Gin Val Thr Pro Thr Ser Leu Ser 

1700 1705 1710 

Ala Gin Trp Thr Pro Pro Asn Val Gin Leu Thr Gly Tyr Arg Val Arg 

1715 1720 1725 

Val Thr Pro Lys Glu Lys Thr Gly Pro Met Lys Glu He Asn Leu Ala 

1730 1735 1740 

Pro Asp Ser Ser Ser Val Val Val Ser Gly Leu Met Val Ala Thr Lys 
1745 1750 1755 1760 

Tyr Glu Val Ser Val Tyr Ala Leu Lys Asp Thr Leu Thr Ser Arg Pro 

1765 " 1770 1775 

Ala Gin Gly Val Val Thr Thr Leu Glu Asn Val Ser Pro Pro Arg Arg 

1780 1785 1790 

Ala Arg Val Thr Asp Ala Thr Glu Thr Thr He Thr He Ser Trp Arg 

1795 1800 1805 

Thr Lys Thr Glu Thr He Thr Gly Phe Gin Val Asp Ala Val Pro Ala 

1810 1815 1820 

Asn Gly Gin Thr Pro He Gin Arg Thr lie Lys Pro Asp Val Arg Ser 
1825 1830 1835 1840 

Tyr Thr He Thr Gly Leu Gin Pro Gly Thr Asp Tyr Lys He Tyr Leu 

1845 1850 1855 

Tyr Thr Leu Asn Asp Asn Ala Arg Ser Ser Pro Val Val He Asp Ala 

1860 1865 1870 

Ser Thr Ala He Asp Ala Pro Ser Asn Leu Arg Phe Leu Ala Thr Thr 

1875 1880 1885 

Pro Asn Ser Leu Leu Val Ser Trp Gin Pro Pro Arg Ala Arg lie Thr 

1890 1895 1900 

Gly Tyr lie He Lys Tyr Glu Lys Pro Gly Ser Pro Pro Arg Glu Val 
1905 1910 1915 1920 

Val Pro Arg Pro Arg Pro Gly Val Thr Glu Ala Thr He Thr Gly Leu 

1925 1930 1935 

Glu Pro Gly Thr Glu Tyr Thr lie Tyr Val He Ala Leu Lys Asn Asn 

1940 1945 1950 

Gin Lys Ser Glu Pro Leu lie Gly Arg Lys Lys Thr Asp Glu Leu Pro 

1955 1960 1965 

Gin Leu Val Thr Leu Pro His Pro Asn Leu His Gly Pro Glu He Leu 

1970 1975 1980 

Asp Val Pro Ser Thr Val Gin Lys Thr Pro Phe Val Thr His Pro Gly 
1985 1990 1995 2000 
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Tyr Asp Thr Gly Asn Gly lie Gin Leu Pro Gly Thr Ser Gly Gin Gin 

2005 2010 2015 

Pro Ser Val Gly Gin Gin Met lie Phe Glu Glu His Gly Phe Arg Arg 

2020 2025 2030 

Thr Thr Pro Pro Thr Thr Ala Thr Pro lie Arg His Arg Pro Arg Pro 

2035 2040 2045 

Tyr Pro Pro Asn Val Gly Gin Glu Ala Leu Ser Gin Thr Thr He Ser 

2050 2055 2060 

Trp Ala Pro Phe Gin Asp Thr Ser Glu Tyr He He Ser Cys His Pro 
2065 2070 2075 2080 

Val Gly Thr Asp Glu Glu Pro Leu Gin Phe Arg Val Pro Gly Thr Ser 

2085 2090 2095 

Thr Ser Ala Thr Leu Thr Gly Leu Thr Arg Gly Ala Thr Tyr Asn He 

2100 2105 2110 

He Val Glu Ala Leu Lys Asp Gin Gin Arg His Lys Val Arg Glu Glu 

2115 2120 2125 

Val Val Thr Val Gly Asn Ser Val Asn Glu Gly Leu Asn Gin Pro Thr 

2130 2135 2140 

Asp Asp Ser Cys Phe Asp Pro Tyr Thr Val Ser His Tyr Ala Val Gly 
2145 2150 2155 2160 

Asp Glu Trp Glu Arg Met Ser Glu Ser Gly Phe Lys Leu Leu Cys Gin 

2165 2170 2175 

Cys Leu Gly Phe Gly Ser Gly His Phe Arg Cys Asp Ser Ser Arg Trp 

2180 2185 2190 

Cys His Asp Asn Gly Val Asn Tyr Lys lie Gly Glu Lys Trp Asp Arg 

2195 2200 2205 

Gin Gly Glu Asn Gly Gin Met Met Ser Cys Thr Cys Leu Gly Asn Gly 

2210 2215 2220 

Lys Gly Glu Phe Lys Cys Asp Pro His Glu Ala Thr Cys Tyr Asp Asp 
2225 2230 2235 2240 

Gly Lys Thr Tyr His Val Gly Glu Gin Trp Gin Lys Glu Tyr Leu Gly 

2245 2250 2255 

Ala He Cys Ser Cys Thr Cys Phe Gly Gly Gin Arg Gly Trp Arg Cys 

2260 2265 2270 

Asp Asn Cys Arg Arg Pro Gly Gly Glu Pro Ser Pro Glu Gly Thr Thr 

2275 2280 2285 

Gly Gin Ser Tyr Asn Gin Tyr Ser Gin Arg Tyr His Gin Arg Thr Asn 

2290 2295 2300 

Thr Asn Val Asn Cys Pro He Glu Cys Phe Met Pro Leu Asp Val Gin 
2305 2310 2315 2320 

Ala Asp Arg Glu Asp Ser Arg Glu 
2325 

<210> 678 
<211> 188 
<212> PRT 
<213> Homo sapiens 

<400> 678 

His Gin Thr His Lys Glu Gly Gly Ser Thr His Ala Ser Ala Asp Ala 

15 10 15 

Trp Glu He He Glu Leu Glu Thr Glu He Glu Lys Phe Lys Ala Glu 

20 25 30 

Asn Ala Ser Leu Ala Lys Leu Arg He Glu Arg Glu Ser Ala Leu Glu 
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675 



<210> 681 
<211> 296 
<212> PRT 
<213> Homo sapiens 

<400> 681 
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Ser 


Thr 


Gly Ser 


Glu Phe Pro Leu 


x 








Ala 


Ala 


Arg Al a 


Glv Glv Ara Ala 






20 




Ara 


Glu 


Thr Arg 


Gly Asn Pro Gly 






35 


40 


Thr 


Met 


Thr Gly 


Ser Asn Met Ser 








55 


Gin 






Ala Ara Phe Ser 










Asn 


Glv 


Gl n TiPH 


Tvr Hi s Glu Hi s 








0 J 




Pro 


Phe Pro 


Glu Glv Leu Phe 






1 on 




Cvs 


Glu 


His Asp 


Phe Gin Met Leu 






115 


120 


Glv 


Glu 


Phe lie 


lie Gly Arg Val 




130 




135 


His 


Pro 


Gly Cys 


Phe Arg Cys Glu 


1 /c 

X*± «J 






1 ^0 


Leu 


m \r 
uiy 


it lie vai 


T .t/c; 7\ en Si a Gl \7 










7\ on 
rioii 


Arg 




Al a Ti\/c Gl v T»^n 

i-lXCL JJjr O OX jr X1CU 






1 a n 




XI X o 




Val lie 


Asn Gl ii Gin Pro 






X -7 _J 


200 


His 


Pro 


Asn His 


Phe Asn Cys Thr 




pin 

£t X W 




215 


Glu 


Ala 


Ara Glu 


Leu Lys Gly Glu 








230 


Lys 


Met 


Gly Val 


Pro lie Cvs Glv 












v ax 




A 7 3 TiPU Glv Lvs 






0 ^ rv 




Cys 


Ala 


Lys Cys 


Glu Lys Pro Phe 






275 


280 


Lys 


Gly 


Leu Ala 


Tyr Cys Glu Leu 




290 




2 95 



<210> 682 
<211> 500 
<212> PRT 
<213> Homo sapiens 

<400> 682 

Met Gly lie Gly Leu Ser Ala Gin 

1 5 
Gly Trp Asp Lys His Ser Tyr Gly 
20 

Phe Cys Ser Ser Gly Thr Gly Gin 

35 40 
Gly Asp Val lie Gly Cys Cys Val 

50 55 
Tyr Thr Lys Asn Gly His Ser Leu 



V-»jr 9 J. liX 


Lys Ala 


Ser 


Pro 


Cvs 


Ser 


1 0 
XU 








X -J 




Leu Gl v 


Trp Arg 


Leu 


Gin 


Gin 


Gin 


25 






30 






Asn Pro 


Gly Leu Gly 


Val 


Ala 


Ala 






45 








Asp Al a 


Leu Ala 


Asn 


Ala 


Val 


Cvs 




60 










Pro Ala 


Glu Arg 


He 


Val 


Asn 


Ser 




75 








80 


Cys Phe 


Val Cys 


Ala 


Gin 


Cys 


Phe 


QO 








95 




Tyr Glu 


Phe Glu Gly 


Ara 


Lys 


Tvr 

xy x 


105 






110 






Phe Ala 


irxO V_yS 


K^y 0 


Glv 

JL 


Ser 


Cvs 

jr >*-> 














He Lys 


ZVl a Mo-f- 


Sen 


Asn 


Asn 


Trp 

XT 














Leu Cys 


Asp Val 


GlU 


Leu 


Ala 


Asp 




155 








160 


Arg Hi s 


Leu Cys 


Arg 


Pro 


Cys 


His 


x / U 








X / 3 




Gl v Ti'VQ 
ox y xjy 0 


Tyr He 


Cys 


Gin 


Arg 


Cys 


1 ft 
10 j 






1 QO 
x^ \J 






T.eu Met 


Phe Arg 


Ser 


Asp 


Ala 


Tvr 






205 








His Cys 


Gly Lys 


Glu 


Leu 


Thr 


Ala 




220 










TiPn Tvr 
xjcu ly x 


Cys Leu 


Pro 


Cys 


His 


Asp 




235 








240 


Ala Cys 


Arg Arg 


Pro 


He 


Glu 


Glv 


O U 








TEC 
ADD 




Gin Trp 


His Val 


Glu 


His 


Phe 


Val 


265 






270 






Leu Gly 


His Arg 


His 


Tyr 


Glu 


Lys 






285 









Gly Val 


Asn Met Asn 


Arg 


Leu 


Pro 


10 






15 




Tyr His 


Gly Asp Asp 


Gly 


His 


Ser 


25 




30 






Pro Tyr 


Gly Pro Thr 


Phe 


Thr 


Thr 




45 








Asn Leu 


He Asn Asn 


Thr 


Cys 


Phe 




60 








Gly He 


Ala Phe Thr 


Asp 


Leu 


Pro 
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60 








H A 

/u 






"D ~y~/-\ A en 
riu ASH 


Leu 


xyr 


±r 1 


x ii J. 


Val 
v ai 


uiy 








O C 
O O 








Hop Hid. 


TV on 


XrXXc 


uiy 


VJj.ll 


hi o 


t 1U 






T A A 
100 










jyiec Arg 


ulU 


Trp 


Arg 


inr 


Lys 


Tl o 

lie 




115 










1 O A 

120 


lie oiy 


Asp 


Arg 


tjlU 


uiy 


ulU 


Trp 


13 0 










135 




oei oer 


Tyr 


Leu 


vai 


111 5 


XI J. 5 


uxy 


T /I C 

145 








150 






Ala Arg 


Ser 


inr 


Asp 


Cain 


inr 


vai 








lob 








ASIl Arg 


vjiix 


Arg 


lie 


oin 


Lys 


Leu 






t o a 
lo 0 










Aid 116 


VjIU 


±nr 


inr 


rf] t-, 

bin 


UJ-il 


Leu 




195 










O A A 

z 00 


Pro Asn 


Leu 


Leu 


pne 


Thr 


Leu 


Lys 


210 










215 




Asn Gly 


Thr 


Asp 


Ser 


Glu 


vai 


Arg 


225 








230 






Ser Gin 


Asp 


Ser 


Tyr 


Pro 


vai 


Ser 








245 








wet ber 


Pro 


Ser 


TT-S 

xilS 


pi , r 

uiy 


Met 


Asn 






260 










Gly Ser 


rpVi -v- 

inr 


Aia 


HIS 


pne 


Ser 


vaiy 




275 










O O A 

280 


lie Ser 


Asn 


Lys 


Ala 


His 


pi v, 

Gin 


Ser 


290 










295 




Ser Ser 


Asn 


lieu 


Asn 


vai 


Pro 


ValU 


305 








310 






Ser Gin 


vain 


vai 


Asn 


Asn 


rfle 


mr 








325 








Asp His 


Tyr 


Ser 


Asn 


Gly 


vai 


vjiy 






340 










Asn Gly 


Ser 


Ser 


iiys 


HIS 


Asp 


TT 4 « 

ill S 




•jrr 

ooo 










*5 £ C\ 


lYieu uiu 


Va 1 

vai 


Asp 


Ser 


Ser 


ni n 


Leu 


T rift 

3 70 










J /o 




Cain Aia 


Aia 


lie 


VjIU 


Arg 


Mot- 


lie 


385 








390 






Met Ser 


Glu 


Gin 


Leu 


Arg 


Arg 


Asp 








405 








Lys Met 


T ^ i -. 

Leu 


Lys 


ASp 


Ala 


7*5 Vi a 


Ser 






420 










Asn Ser 


Pro 


vai 


pi 

Gly 


Asn 


bin 


Leu 




/I "3 C 












Cys Ser 


Hla 


Leu 


Asn 


Ser 


ai a 

Hid 


J. JLC 


450 










4oo 




nil n dta 

ulll XrivJ 




JJC u. 


Al a 


ucu 


Ala 

XT- J- CI 


i"jc; l. 


465 








470 






Leu Met 


Ala 


Arg 


Ser 


Gly 


lie 


Gly 








485 








Asp Tyr 


Leu 


His 















n c 

/ D 




An 
0 u 


Leu 


O J. 11 


Thr* 

X 11JL 


pro nl v nin Val 

VJJ-jr VJIU V CL u. 


Val 




on 




QR 




X 11C 


VclJ. 


XtXXC 


A qtn Tlf* f3l 11 2\QTi 
nop lie uiu noyt 


xyi. 


Iuj 






liu 




o±ll 


Hid 


m n 

OXil 


lie i-io LJ ±\J- y lr 11C 


r 1 u 








TOR 
1Z3 




VjJ-Xl 


Tnr 




lie vjj-ix Xjyo ncL 


"\7a 1 

V dl 








1 A A 
14 U 




iyr 




AT a 

nld 


Trir* Ala f^lii Ala 


XT lie 






IjO 




1 fin 
1 0 u 
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T ,qi n 7\ 1 o Q q v- Tl a 
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T."\/ , e 
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1 / U 




1 7R 
J. / O 




Va 1 

V Ctl 


T.ian 


ai a 


vjiy riiy ncL vjiy 


m ii 

U 1 LI 


lUJ 






15U 




iyj- 


IT J- \J 
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Ash 








9 n r 




Vet JL 


Arg 


OXXX 


dVip Tl p din Mph 
It XlC lie Ulu 1 Y JCL 


Val 

Val 








O *5 A 




Cys 


Leu 




LjI y iixvy dci ir i vj 


xjy 0 






Z J D 




O A A 


Pro 


Arg 


Pro 


irixe jcI OCl XrXTLJ 


Ser 




OCA 




O R R 
ZOO 






Ul e 

m s 




Ti^n Ala Spt* Hi v 

lie Li. nld OCl vJljr 


Lys 


DO 






9 7 A 




irne 


vjrlU 
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Lys oci i-isn vjiy 


Val 








0 ft R 
ZOO 




Tyr 


Cys 


XI J. s 


Cor A en T .t/"ct TTt q 
OCl no 11 uy b ill 0 


U1X1 








^ A A 




Leu 


Asn 


Ser 


lie Asn i v iec oer 


Ar^ 






j Id 




ion 
oz U 


Ser 


2i en 
Afall 


7\ cy-v 
rlop 


Val J\qn Mfaf- nin 

Veil rlop 1*1 C L. Ul Ul 


Thr" 
1111 




"3 1 A 




OOO 






-L 111. 


OCl 


Q#=*T- Aon nil V Pin P 
oci nan uiy riic 


Leu 


*a yl C 






oca 
J OU 




Ul LL 


rlc L. 


m n 

Va J. LL 


Acini (~*\rG Sen T'Vl T* 
iikDlJ V-.yiD Hop 1XX1 


m 11 

U 1 LL 








7 £ R 
«j O O 








ril -n 


UC LL L J D VJljr 


Ser 








"3 ft A 




m o 


jrllC 


m v 

uiy 


Avrr fil n TiAii f^l Tl 
niy uiu. uc (l uin 


Ala 










4 n n 




uiy 


Lys 


Sen T'Vl v* Ala Ben 
flSIl XXXl ±±±Cl rioXX 


uy a 




/I 1 A 

410 




Ai R 




Leu 


Leu 


Aia 


lyxv ocxv Asp xr j. u 


irp 


A O C 






A 1 A 




Asp 


Pro 


J. J-C 


/"Hi n A rn f"2l n DyA 
ol 11 i-iiy ulU. xrivj 


Val 












Leu 


m ii 

ulu 


1 11 IT 


tj -1 e A en T.nn Drn 
X11S AfaXl lie LL XrlVJ 


uy 0 








A C A 




Gly 


Gin 


Ala 


Thr Gin Cys Leu 


Gly 






475 




480 


Ser 


Cys 


Ala 


Phe Ala Thr Val 


Glu 




490 




495 
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500 

<210> 683 
<211> 387 
<212> PRT 

<213> Homo sapiens 
<400> 683 

Met Ala Thr Ser Gly Val Leu Pro Gly Gly Gly Phe Val Ala Ser Ala 

15 10 15 

Ala Ala Val Ala Gly Pro Glu Met Gin Thr Gly Arg Asn Asn Phe Val 

20 25 30 

lie Arg Arg Asn Pro Ala Asp Pro Gin Arg lie Pro Ser Asn Pro Ser 

35 40 45 

His Arg lie Gin Cys Ala Ala Gly Tyr Glu Gin Ser Glu His Asn Val 

50 55 60 

Cys Gin Asp lie Asp Glu Cys Thr Ala Gly Thr His Asn Cys Arg Ala 
65 70 75 80 

Asp Gin Val Cys lie Asn Leu Arg Gly Ser Phe Ala Cys Gin Cys Pro 

85 90 95 

Pro Gly Tyr Gin Lys Arg Gly Glu Gin Cys Val Asp lie Asp Glu Cys 

100 105 110 

Thr He Pro Pro Tyr Cys His Gin Arg Cys Val Asn Thr Pro Gly Ser 

115 120 125 

Phe Tyr Cys Gin Cys Ser Pro Gly Phe Gin Leu Ala Ala Asn Asn Tyr 

130 ~ 135 140 

Thr Cys Val Asp He Asn Glu Cys Asp Ala Ser Asn Gin Cys Ala Gin 
145 150 155 160 

Gin Cys Tyr Asn He Leu Gly Ser Phe He Cys Gin Cys Asn Gin Gly 

165 170 175 

Tyr Glu Leu Ser Ser Asp Arg Leu Asn Cys Glu Asp He Asp Glu Cys 

180 185 190 

Arg Thr Ser Ser Tyr Leu Cys Gin Tyr Gin Cys Val Asn Glu Pro Gly 

195 200 205 

Lys Phe Ser Cys Met Cys Pro Gin Gly Tyr Gin Val Val Arg Ser Arg 

210 215 220 

Thr Cys Gin Asp He Asn Glu Cys Glu Thr Thr Asn Glu Cys Arg Glu 
225 230 235 240 

Asp Glu Met Cys Trp Asn Tyr His Gly Gly Phe Arg Cys Tyr Pro Arg 

245 250 255 

Asn Pro Cys Gin Asp Pro Tyr He Leu Thr Pro Glu Asn Arg Cys Val 

260 265 270 

Cys Pro Val Ser Asn Ala Met Cys Arg Glu Leu Pro Gin Ser He Val 

275 280 285 

Tyr Lys Tyr Met Ser He Arg Ser Asp Arg Ser Val Pro Ser Asp He 

290 295 300 

Phe Gin He Gin Ala Thr Thr He Tyr Ala Asn Thr He Asn Thr Phe 
305 310 315 320 

Arg He Lys Ser Gly Asn Glu Asn Gly Glu Phe Tyr Leu Arg Gin Thr 

325 330 335 

Ser Pro Val Ser Ala Met Leu Val Leu Val Lys Ser Leu Ser Gly Pro 

340 345 350 

Arg Glu His He Val Asp Leu Glu Met Leu Thr Val Ser Ser He Gly 
355 360 365 
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Thr Phe Arg Thr Ser Ser 

370 
Phe Ser Phe 
385 

<210> 684 
<211> 531 
<212> PRT 
<213> Homo sapiens 
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He 
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Val 
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Gly 


Glu 
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Gly 
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Glu 
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Val 
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Gly 


He 


Met 


Val 


Ala 
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290 
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Val 
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Leu 


Ala 


Gin 


305 
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Gly 


Lys 


Pro 


Val 


He 


Cys 
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Thr 
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He 
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uc u. 
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Tl 
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D D 
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juy S3 


nsll 


Val 

v ax 


•Hiy 








on 






xy x 


Leu 


Tvr 
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V dl 
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1 / U 
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Val 
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nl n 








1 DC 
lO J 








7\ c n 


m v 
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Gl \r 
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Al a 


V CL _L 
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UC- UL 
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Ala 


Z 13 












Phe 


VJ-L y 


Val 
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Gin 


Asp 










Z J> D 




uy s 


Al a 


OCX. 




Val 
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z 0 u 






Xjy 0 


a en 


T "I e 
x x c 


XJjr O 


Tie 


He 






zoo 










Phe 
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Glu 


lie 
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z 0 u 










oiy 


7\ cit*v 


JJC LL 


uiy 


Tie 
xxc 


Glu 


z 












Lys 


Met 


Met 


lie 


Glv 


Aro 










315 




Ala 


Thr 


Gin 


Met 


Leu 


Glu 








330 






Ala 


Glu 


Gly 


Ser 


Asp 


Val 
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He 


Val 


Gly 


Pro 


He 


Gin 


Thr 


Gin 






1 R 

X 3 




His 


Met 


Cys 


Arg 




30 






Thr 


Glv 


He 


He 


*± 0 








T»en 

JJC LX 


xiy 0 


Glu 


Met 




nia 


Gl v 
ox y 


1111 








r n 
□ \j 


Thr 


Ala 


Thr 


Glu 






95 




Ala 


Val 


Ala 


Leu 




110 






Lys 


Glv 

vjj.y 


Ser 


Glv 


lot; 
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Lys 


He 


Thr 


Leu 


Leu 


xxp 


Leu 


Asp 








1 fiO 

X O \J 


uy to 


He 


Tvr 
J-y x 


Val 






I/O 




Gly 


Ala 


Asp 


Phe 




190 






Ser 


Lys 


Lys 


Glv 


? n^ 








Val 


Ser 


Glu 


Lys 


Val 


Asp 


Met 


Val 








£t *± U 


Glu 


Val 




Lys 










Ser 


Lys 


lie 


Glu 




270 






Glu 


Ala 


Ser 


Asp 


? a ^ 
a 0 0 








He 


Pro 


Ala 


Glu 


L.y 0 


A en 

ratal! 


Arg 


Al a 








320 


Ser 


Met 


He 


Lys 






335 




Ala 


Asn 


Ala 


Val 
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7 a n 






JJV 


Leu 


Asp 


o±y 


A "1 -a 
Hid 


Asp Cys lie Met 


Leu Ser Gly 


u±U 1J.1I r\J- a. iiya vj-L. y 






ICC 
J J J 




360 






Asp 


l T\rv 


riU 


UCU 


Glu Ala Val Arg 


Met Gin His 


Tion Tie 1 Ala Z\rn f4l n 




J / U 






375 






Ala 


Glu 


Ala 


Ala 


lie Tyr His Leu Gin Leu Phe 


Glu Glu Leu Ara Ara 


o o c 

J O ZD 








^ -/ \j 


395 


400 


Leu 


Ala 


Pro 


lie 


Thr Ser Asp Pro 


Thr Glu Ala 


Thr Ala Val Gly Ala 










405 


410 


415 


\7a 1 


KjA. U. 




UCI 


Phe Lys Cys Cys 


Ser Gly Ala 


Tip Tie Val Leu Thr 

lie J — lc v a. j. j-iv? u. x ill. 








a o n 




425 


t j u 




Cor 
del 


rj"| v 
uiy 


Arg 


Ser Ala His Gin 


Val Ala Arg 


Tvr" Atct Pro Arcr Ala 






/IOC 




440 






Pro 


Tl p> 
J.JLC 




7\ 1 -a 
i-i_L ct 


Val Thr Arg Asn 


Pro Gin Thr 


Ala Atct Hi "n Ala K"i 




/ten 






455 




A£ n 


JJC Li 


npvy 
lyxr 




ft! V 
\3JLy 


lie Phe Pro Val 


Leu Cys Lys 


Asn Pro Val Gin Glu 


A £ 








470 


475 


O \J 


Ala 


lip 


rAJ. a. 


n.1 ii 

UX Li 


Asp Val Asp Leu Arg Val Asn 


Phe Ala Met Asn Val 

X llw *X^> U X 2w O XX V Wi 










485 


490 


495 




T 

Lys 


7V 1 a 
i-i-J-d 


7\ t»/-t 

arg 


Gly Phe Phe Lys 


Lys Gly Asp 


Val Val Tie Val Leu 








bOU 




505 




Thr 


Gly 


Trp 


Arg 


Pro Gly Ser Gly Phe Thr Asn 


Thr Met Arg Val Val 






515 




520 




525 


Pro 


Val 


Pro 












530 













<210> 685 
<211> 480 
<212> PRT 
<213> Homo sapiens 

<400> 685 



Met 


Ala 


Ala 


Arg 


Cys Ser Thr Arg Trp Leu Leu 


Val Val Val Gly Thr 


1 








5 10 


15 


Pro 


Arg 


Leu 


Pro 


Ala He Ser Gly Arg Gly Ala 


Arg Pro Pro Arg Glu 








20 


25 


30 


Gly 


Val 


Val 


Gly 


Ala Trp Leu Ser Arg Lys Leu 


Ser Val Pro Ala Phe 






35 




40. 


45 


Ala 


Ser 


Ser 


Leu 


Thr Ser Cys Gly Pro Arg Ala 


Leu Leu Thr Leu Arg 




50 






55 


60 


Pro 


Gly 


Val 


Ser 


Leu Thr Gly Thr Lys His Asn 


Pro Phe He Cys Thr 


65 








70 75 


80 


Ala 


Ser 


Phe 


His 


Thr Ser Ala Pro Leu Ala Lys 


Glu Asp Tyr Tyr Gin 










85 90 


95 


He 


Leu 


Gly 


Val 


Pro Arg Asn Ala Ser Gin Lys 


Glu He Lys Lys Ala 








100 


105 


110 


Tyr 


Tyr 


Gin 


Leu 


Ala Lys Lys Tyr His Pro Asp 


Thr Asn Lys Asp Asp 






115 




120 


125 


Pro 


Lys 


Ala 


Lys 


Glu Lys Phe Ser Gin Leu Ala 


Glu Ala Tyr Glu Val 




130 






135 


140 


Leu 


Ser 


Asp 


Glu 


Val Lys Arg Lys Gin Tyr Asp 


Ala Tyr Gly Ser Ala 


145 








150 " 155 


160 


Gly 


Phe 


Asp 


Pro 


Gly Ala Ser Gly Ser Gin His 


Ser Tyr Trp Lys Gly 










165 170 


175 
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Gly Piro 


Thr Val Aso 


Pro 


Glu 
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860 










Arg 


Asn 


Gly 


Glu 


Ser 


Met 


Leu 


Asn 


Ala 


Gly 


Leu 


He 


Thr 


Ala 


Ser 


Ser 


865 










870 










875 










880 


Leu 


Gin 


Glu 


Ala 


Glu 


Gin 


Leu 


Gin 


Arg 


Glu 


His 


Glu 


Gin 


Phe 


Gin 


His 










885 










890 










895 




Ala 


He 


Glu 


Lys 


Thr 


His 


Gin 


Ser 


Ala 


Leu 


Gin 


Val 


Gin 


Gin 


Lys 


Ala 








900 










905 










910 






Glu 


Ala 


Met 


Leu 


Gin 


Ala 


Asn 


His 


Tyr 


Asp 


Met 


Asp 


Met 


He 


Arg 


Asp 






915 










920 










925 








Cys 


Ala 


Glu 


Lys 


Val 


Ala 


Ser 


His 


Trp 


Gin* Gin 


Leu 


Met 


Leu 


Lys 


Met 




930 










935 










940 










Glu 


Asp 


Arg 


Leu 


Lys 


Leu 


Val 


Asn 


Ala 


Ser 


Val 


Ala 


Phe 


Tyr 


Lys 


Thr 


945 










950 










955 










960 


Ser 


Glu 


Gin 


Val 


Cys 


Ser 


Val 


Leu 


Glu 


Ser 


Leu 


Glu 


Gin 


Glu 


Tyr 


Lys 










965 










970 










975 




Arg 


Glu 


Glu 


Asp 


Trp 


Cys 


Gly 


Gly 


Ala 


Asp 


Lys 


Leu 


Gly 


Pro 


Asn 


Ser 
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985 










990 






Glu 


Thr 


Asp 


His 


Val 


Thr 


Pro 


Met 


He 


Ser 


Lys 


His 


Leu 


Glu 


Gin 


Lys 






995 










1000 








1005 






Glu 


Ala 


Phe 


Leu 


Lys 


Ala 


Cys 


Thr 


Leu 


Ala 


Arg 


Arg 


Asn 


Ala 


Asp 


Val 




1010 








1015 








1020 








Phe 


Leu 


Lys 


Tyr 


Leu 


His 


Arg Asn 


Ser 


Val 


Asn 


Met Pro Gly 


Met 


Val 


1025 








1030 








1035 








1041 


Thr 


His 


He 


Lys 


Ala 


Pro 


Glu 


Gin 


Gin 


Val 


Lys 


Asn 


He 


Leu 


Asn 


Glu 










1045 








1050 








1055 


Leu 


Phe 


Gin 


Arg 


Glu Asn Arg Val 


Leu 


His Tyr Trp Thr Met 


Arg Lys 








1060 








1065 








1070 




Arg Arg 


Leu 


Asp Gin Cys 


Gin Gin Tyr Val Val Phe Glu Arg 


Ser 


Ala 






1075 








1080 








1085 






Lys 


Gin 


Ala 


Leu 


Glu 


Trp 


He His Asp Asn Gly Glu Phe Tyr 


Leu 


Ser 




1090 








1095 








1100 








Thr 


His 


Thr Ser Thr Gly Ser 


Ser 


He 


Gin His 


Thr 


Gin 


Glu 


Leu 


Leu 



1105 1110 1115 H20 

Lys Glu His Glu Glu Phe Gin He Thr Ala Lys Gin Thr Lys Glu Arg 

1125 1130 1135 

Val Lys Leu Leu He Gin Leu Ala Asp Gly Phe Cys Glu Lys Gly His 

1140 1145 1150 

Ala His Ala Ala Glu He Lys Lys Cys Val Thr Ala Val Asp Lys Arg 

1155 1160 1165 
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Tyr Arg Asp Phe Ser Leu Arg Met Glu Lys Tyr Arg Thr Ser Leu Glu 

1170 1175 " 1180 

Lys Ala Leu Gly lie Ser Ser Asp Ser Asn Lys Ser Ser Lys Ser Leu 
1185 1190 1195 1200 

Gin Leu Asp lie lie Pro Ala Ser lie Pro Gly Ser Glu Val Lys Leu 

1205 1210 1215 

Arg Asp Ala Ala His Glu Leu Asn Glu Glu Lys Arg Lys Ser Ala Arg 

1220 1225 1230 

Arg Lys Glu Phe lie Met Ala Glu Leu lie Gin Thr Glu Lys Ala Tyr 

1235 1240 1245 

Val Arg Asp Leu Arg Glu Cys Met Asp Thr Tyr Leu Trp Glu Met Thr 

1250 1255 1260 

Ser Gly Val Glu Glu lie Pro Pro Gly lie Val Asn Lys Glu Leu lie 
1265 1270 1275 1280 

lie Phe Gly Asn Met Gin Glu lie Tyr Glu Phe His Asn Asn lie Phe 

1285 1290 1295 

Leu Lys Glu Leu Glu Lys Tyr Glu Gin Leu Pro Glu Asp Val Gly His 

1300 1305 1310 

Cys Phe Val Thr Trp Ala Asp Lys Phe Gin Met Tyr Val Thr Tyr Cys 

1315 1320 1325 

Lys Asn Lys Pro Asp Ser Thr Gin Leu lie Leu Glu His Ala Gly Ser 

1330 1335 1340 

Tyr Phe Asp Glu lie Gin Gin Arg His Gly Leu Ala Asn Ser lie Ser 
1345 1350 1355 1360 

Ser Tyr Leu lie Lys Pro Val Gin Arg lie Thr Lys Tyr Gin Leu Leu 

1365 1370 1375 

Leu Lys Glu Leu Leu Thr Cys Cys Glu Glu Gly Lys Gly Glu He Lys 

1380 1385 1390 

Asp Gly Leu Glu Val Met Leu Ser Val Pro Lys Arg Ala Asn Asp Ala 

1395 1400 1405 

Met His Leu Ser Met Leu Glu Gly Phe Asp Glu Asn He Glu Ser Gin 

1410 1415 1420 

Gly Glu Leu He Leu Gin Glu Ser Phe Gin Val Trp Asp Pro Lys Thr 
1425 1430 1435 1440 

Leu He Arg Lys Gly Arg Glu Arg His Leu Phe Leu Phe Glu Met Ser 

1445 1450 1455 

Leu Val Phe Ser Lys Glu Val Lys Asp Ser Ser Gly Arg Ser Lys Tyr 

1460 1465 1470 

Leu Tyr Lys Ser Lys Leu Phe Thr Ser Glu Leu Gly Val Thr Glu His 

1475 1480 1485 

Val Glu Gly Asp Pro Cys Lys Phe Ala Leu Trp Val Gly Arg Thr Pro 

1490 1495 1500 

Thr Ser Asp Asn Lys He Val Leu Lys Ala Ser Ser He Glu Asn Lys 
1505 1510 1515 1520 

Gin Asp Trp lie Lys His He Arg Glu Val lie Gin Glu Arg Thr lie 

1525 1530 1535 

His Leu Lys Gly Ala Leu Lys Glu Pro lie His He Pro Lys Thr Ala 

1540 1545 1550 

Pro Ala Thr Arg Gin Lys Gly Arg Arg Asp Gly Glu Asp Leu Asp Ser 

1555 1560 1565 

Gin Gly Asp Gly Ser Ser Gin Pro Asp Thr He Ser He Ala Ser Arg 

1570 1575 1580 

Thr Ser Gin Asn Thr Leu Asp Ser Asp Lys Leu Ser Gly Gly Cys Glu 
1585 1590 1595 1600 
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Leu Thr Val Val lie His Asp Phe Thr Ala Cys Asn Ser Asn Glu Leu 

1605 1610 1615 

Thr He Arg Arg Gly Gin Thr Val Glu Val Leu Glu Arg Pro His Asp 

1620 1625 1630 

Lys Pro Asp Trp Cys Leu Val Arg Thr Thr Asp Arg Ser Pro Ala Ala 

1635 1640 1645 

Glu Gly Leu Val Pro Cys Gly Ser Leu Cys He Ala His Ser Arg Ser 

1650 1655 1660 

Ser Met Glu Met Glu Gly He Phe Asn His Lys Asp Ser Leu Ser Val 
1665 1670 1675 1680 

Ser Ser Asn Asp Ala Ser Pro Pro Ala Ser Val Ala Ser Leu Gin Pro 

1685 1690 1695 

His Met He Gly Ala Gin Ser Ser Pro Gly Pro Lys Arg Pro Gly Asn 

1700 1705 1710 

Thr Leu Arg Lys Trp Leu Thr Ser Pro Val Arg Arg Leu Ser Ser Gly 

1715 1720 1725 

Lys Ala Asp Gly His Val Lys Lys Leu Ala His Lys His Lys Lys Ser 

1730 1735 1740 

Arg Glu Val Arg Lys Ser Ala Asp Ala Gly Ser Gin Lys Asp Ser Asp 
1745 1750 1755 1760 

Asp Ser Ala Ala Thr Pro Gin Asp Glu Thr Val Glu Glu Arg Gly Arg 

1765 1770 1775 

Asn Glu Gly Leu Ser Ser Gly Thr Leu Ser Lys Ser Ser Ser Ser Gly 

1780 1785 1790 

Met Gin Ser Cys Gly Glu Glu Glu Gly Glu Glu Gly Ala Asp Ala Val 

1795 1800 1805 

Pro Leu Pro Pro Pro Met Ala lie Gin Gin His Ser Leu Leu Gin Pro 

1810 1815 1820 

Asp Ser Gin Asp Asp Lys Ala Ser Ser Arg Leu Leu Val Arg Pro Thr 
1825 1830 1835 1840 

Ser Ser Glu Thr Pro Ser Ala Ala Glu Leu Val Ser Ala He Glu Glu 

1845 1850 1855 

Leu Val Lys Ser Lys Met Ala Leu Glu Asp Arg Pro Ser Ser Leu Leu 

1860 1865 1870 

Val Asp Gin Gly Asp Ser Ser Ser Pro Ser Phe Asn Pro Ser Asp Asn 

1875 1880 1885 

Ser Leu Leu Ser Ser Ser Ser Pro He Asp Glu Met Glu Glu Arg Lys 

1890 1895 1900 

Ser Ser Ser Leu Lys Arg Arg His Tyr Val Leu Gin Glu Leu Val Glu 
1905 1910 1915 1920 

Thr Glu Arg Asp Tyr Val Arg Asp Leu Gly Tyr Val Val Glu Gly Tyr 

1925 1930 1935 

Met Ala Leu Met Lys Glu Asp Gly Val Pro Asp Asp Met Lys Gly Lys 

1940 1945 1950 

Asp Lys He Val Phe Gly Asn He His Gin He Tyr Asp Trp His Arg 

1955 1960 1965 

Asp Phe Phe Leu Gly Glu Leu Glu Lys Cys Leu Glu Asp Pro Glu Lys 

1970 1975 1980 

Leu Gly Ser Leu Phe Val Lys His Glu Arg Arg Leu His Met Tyr He 
1985 1990 * 1995 2000 

Ala Tyr Cys Gin Asn Lys Pro Lys Ser Glu His He Val Ser Glu Tyr 

2005 2010 2015 

He Asp Thr Phe Phe Glu Asp Leu Lys Gin Arg Leu Gly His Arg Leu 
2020 2025 2030 
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Gin Leu Thr Asp Leu Leu He Lys Pro Val Gin Arg He Met Lys Tyr 

2035 2040 2045 

Gin Leu Leu Leu Lys Asp Phe Leu Lys Tyr Ser Lys Lys Ala Ser Leu 

2050 ' 2055 2060 

Asp Thr Ser Glu Leu Glu Arg Ala Val Glu Val Met Cys He Val Pro 
2065 2070 2075 2080 

Arg Arg Cys Asn Asp Met Met Asn Val Gly Arg Leu Gin Gly Phe Asp 

2085 2090 2095 

Gly Lys He Val Ala Gin Gly Lys Leu Leu Leu Gin Asp Thr Phe Leu 

2100 2105 2110 

Val Thr Asp Gin Asp Ala Gly Leu Leu Pro Arg Cys Arg Glu Arg Arg 

2115 2120 2125 

He Phe Leu Phe Glu Gin lie Val He Phe Ser Glu Pro Leu Asp Lys 

2130 2135 2140 

Lys Lys Gly Phe Ser Met Pro Gly Phe Leu Phe Lys Asn Ser He Lys 
2145 * 2150 2155 2160 

Val Ser Cys Leu Cys Leu Glu Glu Asn Val Glu Asn Asp Pro Cys Lys 

2165 2170 2175 

Phe Ala Leu Thr Ser Arg Thr Gly Asp Val Val Glu Thr Phe He Leu 

2180 2185 2190 

His Ser Ser Ser Pro Ser Val Arg Gin Thr Trp He His Glu He Asn 

2195 2200 2205 

Gin He Leu Glu Asn Gin Arg Asn Phe Leu Asn Ala Leu Thr Ser Pro 

2210 2215 2220 

He Glu Tyr Gin Arg Asn His Ser Gly Gly Gly Gly Gly Gly Gly Ser 
2225 2230 2235 2240 

Gly Ala Ala Ala Gly Val Gly Ala Ala Ala Ala Ala Gly Pro Pro Val 

2245 2250 2255 

Ala Ala Ala Ala Thr Val Ala Ala Pro Ala Ala Ala Ala Ala Pro Pro 

2260 2265 2270 

Ala Arg Ala Gly Ala Gly Pro Pro Gly Ser Pro Ser Leu Ser Asp Thr 

2275 2280 2285 

Thr Pro Pro Cys Trp Ser Pro Leu Gin Pro Arg Ala Arg Gin Arg Gin 

2290 2295 2300 

Thr Arg Cys Gin Ser Glu Ser Ser Ser Ser Ser Asn He Ser Thr Met 
2305 2310 2315 2320 

Leu Val Thr His Asp Tyr Thr Ala Val Lys Glu Asp Glu He Asn Val 

2325 ^ 2330 2335 

Tyr Gin Gly Glu Val Val Gin He Leu Ala Ser Asn Gin Gin Asn Met 

2340 2345 2350 

Phe Leu Val Phe Arg Ala Ala Thr Asp Gin Cys Pro Ala Ala Glu Gly 

2355 2360 2365 

Trp He Pro Gly Phe Val Leu Gly His Thr Ser Ala Val He Val Glu 

2370 2375 2380 

Asn Pro Asp Gly Thr Leu Lys Lys Ser Thr Ser Trp His Thr Ala Leu 
2385 2390 2395 2400 

Arg Leu Arg Lys Lys Ser Glu Lys Lys Asp Lys Asp Gly Lys Arg Glu 

2405 2410 2415 

Gly Lys Leu Glu Asn Gly Tyr Arg Lys Ser Arg Glu Gly Leu Ser Asn 

2420 2425 2430 

Lys Val Ser Val Lys Leu Leu Asn Pro Asn Tyr He Tyr Asp Val Pro 

2435 2440 2445 

Pro Glu Phe Val He Pro Leu Ser Glu Val Thr Cys Glu Thr Gly Glu 
2450 2455 2460 
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Thr Val Val Leu Arg Cys Arg Val Cys Gly Arg Pro Lys Ala Ser lie 
2465 2470 2475 2480 

Thr Trp Lys Gly Pro Glu His Asn Thr Leu Asn Asn Asp Gly His Tyr 

2485 2490 2495 

Ser lie Ser Tyr Ser Asp Leu Gly Glu Ala Thr Leu Lys lie Val Gly 

2500 2505 2510 

Val Thr Thr Glu Asp Asp Gly lie Tyr Thr Cys lie Ala Val Asn Asp 

2515 2520 2525 

Met Gly Ser Ala Ser Ser Ser Ala Ser Leu Arg Val Leu Gly Pro Gly 

2530 2535 2540 

Met Asp Gly lie Met Val Thr Trp Lys Asp Asn Phe Asp Ser Phe Tyr 
2545 2550 2555 2560 

Ser Glu Val Ala Glu Leu Gly Arg Gly Arg Phe Ser Val Val Lys Lys 

2565 2570 2575 

Cys Asp Gin Lys Gly Thr Lys Arg Ala Val Ala Thr Lys Phe Val Asn 

2580 2585 2590 

Lys Lys Leu Met Lys Arg Asp Gin Val Thr His Glu Leu Gly lie Leu 

2595 2600 2605 

Gin Ser Leu Gin His Pro Leu Leu Val Gly Leu Leu Asp Thr Phe Glu 

2610 2615 2620 

Thr Pro Thr Ser Tyr lie Leu Val Leu Glu Met Ala Asp Gin Gly Arg 
2625 2630 2635 2640 

Leu Leu Asp Cys Val Val Arg Trp Gly Ser Leu Thr Glu Gly Lys lie 

2645 2650 2655 

Arg Ala His Leu Gly Glu Val Leu Glu Ala Val Arg Tyr Leu His Asn 

2660 2665 2670 

Cys Arg lie Ala His Leu Asp Leu Lys Pro Glu Asn lie Leu Val Asp 

2675 2680 2685 

Glu Ser Leu Ala Lys Pro Thr lie Lys Leu Ala Asp Phe Gly Asp Ala 

2690 2695 2700 

Val Gin Leu Asn Thr Thr Tyr Tyr lie His Gin Leu Leu Gly Asn Pro 
2705 2710 2715 2720 

Glu Phe Ala Ala Pro Glu lie lie Leu Gly Asn Pro Val Ser Leu Thr 

2725 2730 2735 

Ser Asp Thr Trp Ser Val Gly Val Leu Thr Tyr Val Leu Leu Ser Gly 

2740 2745 2750 

Val Ser Pro Phe Leu Asp Asp Ser Val Glu Glu Thr Cys Leu Asn lie 

2755 2760 2765 

Cys Arg Leu Asp Phe Ser Phe Pro Asp Asp Tyr Phe Lys Gly Val Ser 

2770 2775 2780 

Gin Lys Ala Lys Glu Phe Val Cys Phe Leu Leu Gin Glu Asp Pro Ala 
2785 2790 2795 2800 

Lys Arg Pro Ser Ala Ala Leu Ala Leu Gin Glu Gin Trp Leu Gin Ala 

2805 2810 2815 

Gly Asn Gly Arg Ser Thr Gly Val Leu Asp Thr Ser Arg Leu Thr Ser 

2820 2825 2830 

Phe lie Glu Arg Arg Lys His Gin Asn Asp Val Arg Pro lie Arg Ser 

2835 2840 2845 

lie Lys Asn Phe Leu Gin Ser Arg Leu Leu Pro Arg Val 
2850 2855 2860 
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He 
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Glu 


Phe Arg Val 


1 








5 
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Leu Tyr Ser 








20 






25 


Glu 


Thr 


Gly 


Gly 


Gly Glu 


Gly He Glu 






35 








40 


Glu Lys 


Asp 


Gly 


Glu Lys 


Gly Gin Tyr 




50 










55 


Lys 


Ser 


Lys 


Val 


Pro 


Ala 


Phe Val Arg 


65 










70 




Leu 


Val 


Phe 


His 


Glu Lys 


Ala Trp Asn 










85 






lie 


Val 


Thr 


Asn 


Glu Tyr 


Met Lys Asp 








100 






105 


Thr 


Trp 


His 


Lys 


Pro Asp 


Leu Gly Thr 






115 








120 


Asp 


Pro 


Asn 


Thr 


Trp Lys 


Thr Val Glu 




130 










135 


Asp Arg 


Ser 


Gin 


Val 


Glu 


Pro Ala Asp 


145 










150 




Ala 


Leu 


Phe 


Gin 


Ser 


Val 


Lys Thr Lys 










165 






Trp 


Lys 


Lys 

2 


Glu 


Leu 


Ala 


Asn Ser Pro 








180 






185 


Tyr 


Lys 


Leu 


Val 


Thr 


He 


Lys Phe Lys 






195 








200 


Val 


Glu 


Asn 


Phe 


He 


Gin 


Lys Gin Glu 




210 










215 


His 


Arg 


Gin 


Leu 


Phe 


Cys 


Trp He Asp 


225 










230 




Glu Asp 


He 


Arg 


Arg 


Met 


Glu Asp Glu 










245 






Met Arg 


Lys 


Arg 


Gly Ser 


Val Arg Gly 








260 






265 
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10 




15 
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Leu Lys Asn Glu 
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His Lys He Tyr 


His 


Leu 
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Met 


He Ala Pro Glu 
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Ala 
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Thr 


90 
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He 


Glu 




110 






Leu 


Glu Asn Val His 


Gly 


Leu 




125 






He 
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He 


Ala 




140 






Tyr 


Lys Ala Asp Glu 


Asp 


Pro 
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Gly Pro Leu Gly 
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170 
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Cys 


Ala 




190 






Trp 
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Lys 
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Thr 


Met 
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Glu 


Thr 
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Thr 


Ser Ala Ala Asp 


Val 
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11 
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<400> 690 

tcacccttgg caatacagat gt 22 

<210> 691 

<211> 21 

<212> DNA 

<213> Homo sapiens 
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<400> 691 

cccacccatg tgcccgtgac a 

<210> 692 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 692 

cgtgattgcc gacgatctc 

<210> 693 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 693 

ttccggccgc tagtcttgt 

<210> 694 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 694 

cgcacccgtt cggtctgacg cagtac 

<210> 695 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 695 

ctttccccac gagtatgttg gt 

<210> 696 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 696 

aagggaccgt gctgtgagc 

<210> 697 

<211> 29 

<212> DNA 

<213> Homo sapiens 

<400> 697 

agcccctcat gtgcctctca acttcatag 

<210> 698 
<211> 6 
<212> PRT 
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21 



19 



19 



26 



22 



19 



29 
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<213> Artificial Sequence 
<220> 

<223> partial signal sequence of HBM and Zmaxl 
<400> 698 

Ser Pro Ala His Ser Ser 
1 5 



<210> 699 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic duplex 
<400> 699 

tcgaccacca tggaggcagc gccgc 

<210> 700 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic duplex 
<400> 700 

ggtggtacct ccgtcgcggc gggcc 

<210> 701 
<211> 55 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic duplex 
<400> 701 

catgtgtgaa atcaccaagc cgccctcaga 

<210> 702 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic duplex 
<400> 702 

tcgagtacac actttagtgg ttcggcggga 
egg 



25 



cgacagcccg gcccacagca gtggc 55 



gtctgctgtc gggccgggtg tcgtcaccgc 60 

63 
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<210> 703 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 703 

Val Glu Thr Pro Arg He Glu Arg Ala Gly Met Asp Gly Ser Thr Arg 

15 10 15 

Lys 



<210> 704 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 704 

Asn Lys Arg Thr Gly Gly Lys Arg Lys Glu He Leu Ser Ala Leu 
15 10 15 



<210> 705 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 705 

Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu 
15 10 



<210> 706 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 706 

Val Asp Gly Thr Lys Arg Arg Thr Leu Leu Glu Asp Lys Leu Pro 
1 ~ 5 10 15 
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<210> 707 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 707 

Asp Gly Leu Asn Asp Cys Met His Asn Asn Gly Gin Cys Gly Gin 
1 5 10 15 



<210> 708 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 708 

Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn 
15 10 



<210> 709 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 709 

Tyr Ala Gly Ala Asn Gly Pro Phe Pro His Glu Tyr Val Ser Gly 
15 10 15 



<210> 710 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 710 

Ala Cys Gly Lys Ser Met Met Ser Ser Val Ser Leu Met 
15 10 



<210> 711 
<211> 18 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 711 

Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser Asp Ser Asp 

15 10 15 

Pro Tyr 



<210> 712 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 712 

Ser Gly Trp Asn Glu Cys Ala Ser Ser Asn Gly His Cys Ser His 
1 ' 5 10 15 



<210> 713 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 713 

Asn Gly Asp Ala Asn Cys Gin Asp Lys Ser Asp Glu Lys Asn 
1 5 10 



<210> 714 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 714 

lie Lys Arg Ala Lys Asp Asp Gly Thr Gin Pro Phe Val Leu Thr Ser 

15 10 15 

Leu Ser Gin Gly Gin Asn 
20 



<210> 715 
<211> 38 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> primer 
<400> 715 

cgtacgtaaa gcggccgctt ggcaatacag atgtggga 

<210> 716 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 716 

cgtacgtaaa gcggccgcag tagctcctct cggtggc 

<210> 717 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 717 

cgtacgtaaa gcggccgcgc tcatcatgga ctttccg 

<210> 718 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 718 

cgtacgtaaa gcggccgcgc actgctgttt gatgagg 

<210> 719 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 719 

cgtacgtaaa gcggccgcga gtgtggaaga aaggctgc 

<210> 720 
<211> 38 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 720 

cgtacgtaaa gcggccgcag tagagcttcc cctcctgc 3 8 

<210> 721 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 721 

cgtacgtaaa gcggccgcgt ccatcacgaa gtccaggt 3 8 

<210> 722 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 722 

cagcggcctg gaggatgc 18 

<210> 723 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 723 

cggtccagta gaggtttcg 19 

<210> 724 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 724 

catcagccgc gccttcatg 19 
<210> 725 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 725 

cctgcatgtt ggtgaagtac 

<210> 726 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 726 

gcccgaaacc tctactggac cgac 

<210> 727 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 727 

gcccacccca tcacagttca catt 

<210> 728 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 728 

cagcggcctg gaggatgc 

<210> 729 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 729 

cgggtacatg tactggacag ctgattagc 



24 



24 



18 



29 
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<210> 730 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 730 

gtcagcctgg aggagttctc a 21 

<210> 731 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 731 

tcacccttgg caatacagat gt 22 

<210> 732 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 732 

cccacccatg tgcccgtgac a 21 

<210> 733 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 733 

cgtgattgcc gacgatctc 19 

<210> 734 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 734 

ttccggccgc tagtcttgt 19 

<210> 735 
<211> 26 
<212> DNA 

<213> Homo sapiens 
<400> 735 

cgcacccgtt cggtctgacg cagtac 2 6 

<210> 736 
<211> 22 
<212> DNA 
<213> Mouse 

<400> 736 
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ctttccccac gagtatgttg gt 22 

<210> 737 
<211> 19 
<212> DNA 
<213> Mouse 

<400> 737 

aagggaccgt gctgtgagc 19 

<210> 738 
<211> 29 
<212> DNA 
<213> Mouse 

<400> 738 

agcccctcat gtgcctctca acttcatag 29 

<210> 739 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> primer 
<400> 739 

gagcgggcag ggatggatgg 2 0 

<210> 740 
<211> 22 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 740 

aggttggcac ggtggatgaa gc 22 

<210> 741 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 741 

aagcttgttt aaactgggca tggtggcaca tggttgtaat 40 

<210> 742 
<211> 34 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> primer 
<400> 742 

gggcttccac ccagtcagtc cagtacatgt acct 34 



<210> 743 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 743 

ctgactgggt ggaagcaccc cggatcgagc 

<210> 744 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> primer 
<400> 744 

gaattcatcg gtacctgtgc ggccgcttca ttg 33 

<210> 745 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 745 

tgagatgtcc tgtctgtggc 20 

<210> 746 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 746 

tccttccttc cctacagttg 20 

<210> 747 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 747 

cctaaggatc tccttgtgtc tgtgg 

<210> 748 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 748 

ctgcagcagg tcagtagcct gc 

<210> 749 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 749 

tgagatgtcc tgtctgtggc 

<210> 750 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 750 

tccttccttc cctacagttg 

<210> 751 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220>" 

<223> primer 
<400> 751 

gcttctggcg tgtgaccggc g 
<210> 752 



22 



20 



20 



21 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 752 

gccgcacagc gccagcagca gc 22 

<210> 753 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 753 

cacccacgcc ccacagccag ta 22 

<210> 754 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 754 

atttgccctc ccatatgtcc ttcc 24 

<210> 755 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 755 

ttcctcccag ccctcctcca tcag 24 

<210> 756 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 756 

gccgcacagc gccagcagca gc 22 
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<210> 757 
<211> 19 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> primer 



<400> 757 

gaatggcgcc cccgacgac 19 

<210> 758 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 758 

gctcccattc atcagttcca tagg 24 



<210> 759 
<211> 5278 
<212> DNA 
<213> Mouse 



<400> 759 

tctagactcg aggcggccgc ccattgtgca 
gagtgagcgc ggcgcgggcc cgtccggccg 
ccgccgtggc cgctgctgct gctgctgctg 
cccgccgcgg cctcgccgct cctgctattt 
gccggcggag tcaagctgga gtccaccatc 
gtggacttcc agttttccaa gggagccgtg 
aagcagacct acctgaacca gacgggggcc 
gtctctcccg acggcctcgc ctgcgactgg 
gagaccaacc gcatcgaggt ggccaacctc 
caggaccttg accagccgag ggccatcgcc 
acagactggg ttgagacgcc ccggattgag 
atcattgtgg actcggacat ttactggccc 
aagctctact gggctgacgc caagctcagc 
ttccggcaga aggtggtgga gggcagcctg 
gacactctgt actggacaga ctggcagacc 
ggggggaaga ggaaggagat cctgagtgcc 
agccaggagc ggcagccttt cttccacact 
cacctgtgcc tgctgtcccc aagcgagcct 
cagctgcagg acaacggcag gacgtgtaag 
cggcggacgg acctacggag gatctcgctg 
caggtggacg acatccggca cgccattgcc 
tactggacag atgacgaggt gcgggccatc 
cagacgctgg tcaacaccga gatcaacgac 
cgaaacctct actggaccga cacgggcacg 
acctcccgca agatcctggt gtcggaggac 
cccgtgatgg gcctcatgta ctggacagac 



ctaaagcgcc gccgccgcgc catggagccc 60 
ccggacaaca tggaggcagc gccgcccggg 120 
ctgctgctgg cgctgtgcgg ctgcccggcc 180 
gccaaccgcc gggacgtacg gctggtggac 240 
gtggtcagcg gcctggagga tgcggccgca 3 00 
tactggacag acgtgagcga ggaggccatc 3 60 
gccgtgcaga acgtggtcat ctccggcctg 42 0 
gtgggcaaga agctgtactg gacggactca 4 80 
aatggcacat cccggaaggt gctcttctgg 540 
ttggaccccg ctcacgggta catgtactgg 600 
cgggcaggga tggatggcag cacccggaag 660 
aatggactga ccatcgacct ggaggagcag 72 0 
ttcatccacc gtgccaacct ggacggctcg 780 
acgcacccct tcgccctgac gctctccggg 840 
cgctccatcc atgcctgcaa caagcgcact 900 
ctctactcac ccatggacat ccaggtgctg 960 
cgctgtgagg aggacaatgg cggctgctcc 102 0 
ttctacacat gcgcctgccc cacgggtgtg 1080 
gcaggagccg aggaggtgct gctgctggcc 114 0 
gacacgccgg acttcaccga catcgtgctg 1200 
atcgactacg acccgctaga gggctatgtc 1260 
cgcagggcgt acctggacgg gtctggggcg 132 0 
cccgatggca tcgcggtcga ctgggtggcc 13 8 0 
gaccgcatcg aggtgacgcg cctcaacggc 144 0 
ctggacgagc cccgagccat cgcactgcac 15 0 0 
tggggagaga accctaaaat cgagtgtgcc 1560 
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aacttggatg ggcaggagcg gcgtgtgctg 
ctggccctgg acctgcagga ggggaagctc 
gaggtgatca atgttgatgg gacgaagagg 
attttcgggt tcacgctgct gggggacttc 
atcgagcggg tgcacaaggt caaggccagc 
ctgatggggc tcaaagctgt gaatgtggcc 
aggaacgggg ggtgcagcca cctgtgcttc 
cccatcggcc tggagctgct gagtgacatg 
gtcttcacca gcagagccgc catccacagg 
gccatcccgc tcacgggcgt caaggaggcc 
cacatctact ggacagacgt cagcctgaag 
tcggtggagc acgtggtgga gtttggcctt 
atgggcaaga acctctactg ggccgacact 
gacgggcagt tccggcaagt cctcgtgtgg 
ctggatccca ccaagggcta catctactgg 
cgggccttca tggacgggac caactgcatg 
gacctcacca ttgactacgc tgaccagcgc 
atcgagtcgt ccaacatgct gggtcaggag 
ccgttcggtc tgacgcagta cagcgattat 
attgagcggg ccgacaagac tagcggccgg 
ttcgtgatgg acatcctggt gttccactcc 
cacaacaacg ggcagtgtgg gcagctgtgc 
tgcgcctcac actacaccct ggaccccagc 
ttgctgttca gccagaaatc tgccatcagt 
gatctcatcc tgcccctgca tggactgagg 
gacaagttca tctactgggt ggatgggcgc 
acccagccct ttgttttgac ctctctgagc 
gacctcagca tcgacatcta cagccggaca 
atcaacgtcc acaggctgag cggggaagcc 
aagcccaggg ccatcgtcgt caacgcggag 
gaccgggcag ccaagatcga acgcgcagcc 
accaccggcc tcatccgccc tgtggccctg 
tgggtggacg cggacctgaa gcgcattgag 
accctggagg acgccaacat cgtgcagcct 
tactggatcg accgccagca gcagatgatc 
cggactcgca tccagggccg tgtcgcccac 
agcctggagg agttctcagc ccacccatgt 
tgtattgcca agggtgatgg gacaccacgg 
cagaacctgc tgacctgtgg agagccgccc 
acaggggaga tcgactgtat ccccggggcc 
gaccagagcg acgaggaggg ctgccccgtg 
ggtcagtgtg tggacctgcg cctgcgctgc 
gacgaggcgg actgtgacgc catctgcctg 
tgtgtcctca tcaaacagca gtgcgactcc 
ctcatgtgtg aaatcaccaa gccgccctca 
gggcccgtca ttggcatcat cctctctctc 
cagcgcgtgg tgtgccagcg ctatgcgggg 
agcgggaccc cgcacgtgcc cctcaatttc 
ttcacaggca tcgcatgcgg aaagtccatg 
ggcggggtgc ccctctacga ccggaaccac 
agcacgaagg ccacgctgta cccgccgatc 
ccctccctgt acaacatgga catgttctac 
tacaggccct acatcattcg aggaatggcg 
tgtgacagcg actacagcgc cagccgctgg 



gtcaatgcct ccctcgggtg gcccaacggc 162 0 
tactggggag acgccaagac agacaagatc 1680 
cggaccctcc tggaggacaa gctcccgcac 174 0 
atctactgga ctgactggca gcgccgcagc 1800 
cgggacgtca tcattgacca gctgcccgac 1860 
aaggtcgtcg gaaccaaccc gtgtgcggac 1920 
ttcacacccc acgcaacccg gtgtggctgc 1980 
aagacctgca tcgtgcctga ggccttcttg 2040 
atctccctcg agaccaataa caacgacgtg 2100 
tcagccctgg actttgatgt gtccaacaac 2160 
accatcagcc gcgccttcat gaacgggagc 222 0 
gactaccccg agggcatggc cgttgactgg 228 0 
gggaccaaca gaatcgaagt ggcgcggctg 2340 
agggacttgg acaacccgag gtcgctggcc 2400 
accgagtggg gcggcaagcc gaggatcgtg 24 60 
acgctggtgg acaaggtggg ccgggccaac 252 0 
ctctactgga ccgacctgga caccaacatg 2580 
cgggtcgtga ttgccgacga tctcccgcac 2 640 
atctactgga cagactggaa tctgcacagc 2700 
aaccgcaccc tcatccaggg ccacctggac 2760 
tcccgccagg atggcctcaa tgactgtatg 282 0 
cttgccatcc ccggcggcca ccgctgcggc 2 880 
agccgcaact gcagcccgcc caccaccttc 2940 
cggatgatcc cggacgacca gcacagcccg 3 000 
aacgtcaaag ccatcgacta tgacccactg 3 060 
cagaacatca agcgagccaa ggacgacggg 3120 
caaggccaaa acccagacag gcagccccac 3180 
ctgttctgga cgtgcgaggc caccaatacc 3240 
atgggggtgg tgctgcgtgg ggaccgcgac 3 3 00 
cgagggtacc tgtacttcac caacatgcag 3360 
ctggacggca ccgagcgcga ggtcctcttc 342 0 
gtggtagaca acacactggg caagctgttc 34 80 
agctgtgacc tgtcaggggc caaccgcctg 3540 
ctgggcctga ccatccttgg caagcatctc 3 600 
gagcgtgtgg agaagaccac cggggacaag 3 660 
ctcactggca tccatgcagt ggaggaagtc 372 0 
gcccgtgaca atggtggctg ctcccacatc 3 780 
tgctcatgcc cagtccacct cgtgctcctg 3 840 
acctgctccc cggaccagtt tgcatgtgcc 3 900 
tggcgctgtg acggctttcc cgagtgcgat 3 960 
tgctccgccg cccagttccc ctgcgcgcgg 4020 
gacggcgagg cagactgtca ggaccgctca 4080 
cccaaccagt tccggtgtgc gagcggccag 414 0 
ttccccgact gtatcgacgg ctccgacgag 4200 
gacgacagcc cggcccacag cagtgccatc 4260 
ttcgtcatgg gtggtgtcta ttttgtgtgc 4320 
gccaacgggc ccttcccgca cgagtatgtc 4380 
atagccccgg gcggttccca gcatggcccc 4440 
atgagctccg tgagcctgat ggggggccgg 4500 
gtcacagggg cctcgtccag cagctcgtcc 4560 
ctgaacccgc cgccctcccc ggccacggac 4 620 
tcttcaaaca ttccggccac tgcgagaccg 4680 
cccccgacga cgccctgcag caccgacgtg 4740 
aaggccagca agtactacct ggatttgaac 4800 
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tcggactcag acccctatcc acccccaccc 
gacagctgcc cgccctcgcc cgccaccgag 
ccgtccccct gcacggactc atcctgacct 
ctgtaaatag ttttaaatat gaacaaagaa 
tataattggg attttaaaaa catgagaaat 
agaactttgt acagtggaga aatatttata 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



acgccccaca gccagtacct gtcggcggag 4860 

aggagctact tccatctctt cccgccccct 492 0 

cggccgggcc actctggctt ctctgtgccc 4980 

aaaaatatat tttatgattt aaaaaataaa 5040 

gtgaactgtg atggggtggg cagggctggg 510 0 

aacttaattt tgtaaaacag aaaaaaaaaa 5160 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 522 0 

aaaaaaaaaa aaaaaaaaaa gcggccgc 527 8 



<210> 760 

<211> 5278 

<212> DNA 

<213> Homo sapiens 

<400> 760 

tctagactcg aggcggccgc ccattgtgca ctaaagcgcc gccgccgcgc catggagccc 60 
gagtgagcgc ggcgcgggcc cgtccggccg ccggacaaca tggaggcagc gccgcccggg 12 0 
ccgccgtggc cgctgctgct gctgctgctg ctgctgctgg cgctgtgcgg ctgcccggcc 18 0 
cccgccgcgg cctcgccgct cctgctattt gccaaccgcc gggacgtacg gctggtggac 24 0 
gccggcggag tcaagctgga gtccaccatc gtggtcagcg gcctggagga tgcggccgca 300 
gtggacttcc agttttccaa gggagccgtg tactggacag acgtgagcga ggaggccatc 3 60 
aagcagacct acctgaacca gacgggggcc gccgtgcaga acgtggtcat ctccggcctg 42 0 
gtctctcccg acggcctcgc ctgcgactgg gtgggcaaga agctgtactg gacggactca 4 80 
gagaccaacc gcatcgaggt ggccaacctc aatggcacat cccggaaggt gctcttctgg 540 
caggaccttg accagccgag ggccatcgcc ttggaccccg ctcacgggta catgtactgg 600 
acagactggg gtgagacgcc ccggattgag cgggcaggga tggatggcag cacccggaag 660 
atcattgtgg actcggacat ttactggccc aatggactga ccatcgacct ggaggagcag 72 0 
aagctctact gggctgacgc caagctcagc ttcatccacc gtgccaacct ggacggctcg 780 
ttccggcaga aggtggtgga gggcagcctg acgcacccct tcgccctgac gctctccggg 840 
gacactctgt actggacaga ctggcagacc cgctccatcc atgcctgcaa caagcgcact 900 
ggggggaaga ggaaggagat cctgagtgcc ctctactcac ccatggacat ccaggtgctg 960 
agccaggagc ggcagccttt cttccacact cgctgtgagg aggacaatgg cggctgctcc 102 0 
cacctgtgcc tgctgtcccc aagcgagcct ttctacacat gcgcctgccc cacgggtgtg 10 80 
cagctgcagg acaacggcag gacgtgtaag gcaggagccg aggaggtgct gctgctggcc 114 0 
cggcggacgg acctacggag gatctcgctg gacacgccgg acttcaccga catcgtgctg 12 0 0 
caggtggacg acatccggca cgccattgcc atcgactacg acccgctaga gggctatgtc 1260 
tactggacag atgacgaggt gcgggccatc cgcagggcgt acctggacgg gtctggggcg 1320 
cagacgctgg tcaacaccga gatcaacgac cccgatggca tcgcggtcga ctgggtggcc 13 80 
cgaaacctct actggaccga cacgggcacg gaccgcatcg aggtgacgcg cctcaacggc 1440 
acctcccgca agatcctggt gtcggaggac ctggacgagc cccgagccat cgcactgcac 1500 
cccgtgatgg gcctcatgta ctggacagac tggggagaga accctaaaat cgagtgtgcc 1560 
aacttggatg ggcaggagcg gcgtgtgctg gtcaatgcct ccctcgggtg gcccaacggc 1620 
ctggccctgg acctgcagga ggggaagctc tactggggag acgccaagac agacaagatc 1680 
gaggtgatca atgttgatgg gacgaagagg cggaccctcc tggaggacaa gctcccgcac 1740 
attttcgggt tcacgctgct gggggacttc atctactgga ctgactggca gcgccgcagc 1800 
atcgagcggg tgcacaaggt caaggccagc cgggacgtca tcattgacca gctgcccgac 1860 
ctgatggggc tcaaagctgt gaatgtggcc aaggtcgtcg gaaccaaccc gtgtgcggac 192 0 
aggaacgggg ggtgcagcca cctgtgcttc ttcacacccc acgcaacccg gtgtggctgc 198 0 
cccatcggcc tggagctgct gagtgacatg aagacctgca tcgtgcctga ggccttcttg 2 04 0 
gtcttcacca gcagagccgc catccacagg atctccctcg agaccaataa caacgacgtg 2100 
gccatcccgc tcacgggcgt caaggaggcc tcagccctgg actttgatgt gtccaacaac 2160 
cacatctact ggacagacgt cagcctgaag accatcagcc gcgccttcat gaacgggagc 222 0 
tcggtggagc acgtggtgga gtttggcctt gactaccccg agggcatggc cgttgactgg 22 8 0 
atgggcaaga acctctactg ggccgacact gggaccaaca gaatcgaagt ggcgcggctg 2 34 0 
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gacgggcagt tccggcaagt cctcgtgtgg agggacttgg acaacccgag gtcgctggcc 2400 
ctggatccca ccaagggcta catctactgg accgagtggg gcggcaagcc gaggatcgtg 2460 
cgggccttca tggacgggac caactgcatg acgctggtgg acaaggtggg ccgggccaac 2520 
gacctcacca ttgactacgc tgaccagcgc ctctactgga ccgacctgga caccaacatg 2580 
atcgagtcgt ccaacatgct gggtcaggag cgggtcgtga ttgccgacga tctcccgcac 264 0 
ccgttcggtc tgacgcagta cagcgattat atctactgga cagactggaa tctgcacagc 2700 
attgagcggg ccgacaagac tagcggccgg aaccgcaccc tcatccaggg ccacctggac 2760 
ttcgtgatgg acatcctggt gttccactcc tcccgccagg atggcctcaa tgactgtatg 2 82 0 
cacaacaacg ggcagtgtgg gcagctgtgc cttgccatcc ccggcggcca ccgctgcggc 288 0 
tgcgcctcac actacaccct ggaccccagc agccgcaact gcagcccgcc caccaccttc 2940 
ttgctgttca gccagaaatc tgccatcagt cggatgatcc cggacgacca gcacagcccg 3 000 
gatctcatcc tgcccctgca tggactgagg aacgtcaaag ccatcgacta tgacccactg 3060 
gacaagttca tctactgggt ggatgggcgc cagaacatca agcgagccaa ggacgacggg 312 0 
acccagccct ttgttttgac ctctctgagc caaggccaaa acccagacag gcagccccac 3180 
gacctcagca tcgacatcta cagccggaca ctgttctgga cgtgcgaggc caccaatacc 3240 
atcaacgtcc acaggctgag cggggaagcc atgggggtgg tgctgcgtgg ggaccgcgac 3300 
aagcccaggg ccatcgtcgt caacgcggag cgagggtacc tgtacttcac caacatgcag 3360 
gaccgggcag ccaagatcga acgcgcagcc ctggacggca ccgagcgcga ggtcctcttc 342 0 
accaccggcc tcatccgccc tgtggccctg gtggtagaca acacactggg caagctgttc 3480 
tgggtggacg cggacctgaa gcgcattgag agctgtgacc tgtcaggggc caaccgcctg 354 0 
accctggagg acgccaacat cgtgcagcct ctgggcctga ccatccttgg caagcatctc 3600 
tactggatcg accgccagca gcagatgatc gagcgtgtgg agaagaccac cggggacaag 3660 
cggactcgca tccagggccg tgtcgcccac ctcactggca tccatgcagt ggaggaagtc 3720 
agcctggagg agttctcagc ccacccatgt gcccgtgaca atggtggctg ctcccacatc 3780 
tgtattgcca agggtgatgg gacaccacgg tgctcatgcc cagtccacct cgtgctcctg 3 840 
cagaacctgc tgacctgtgg agagccgccc acctgctccc cggaccagtt tgcatgtgcc 3900 
acaggggaga tcgactgtat ccccggggcc tggcgctgtg acggctttcc cgagtgcgat 3 960 
gaccagagcg acgaggaggg ctgccccgtg tgctccgccg cccagttccc ctgcgcgcgg 4 020 
ggtcagtgtg tggacctgcg cctgcgctgc gacggcgagg cagactgtca ggaccgctca 4080 
gacgaggcgg actgtgacgc catctgcctg cccaaccagt tccggtgtgc gagcggccag 4140 
tgtgtcctca tcaaacagca gtgcgactcc ttccccgact gtatcgacgg ctccgacgag 42 00 
ctcatgtgtg aaatcaccaa gccgccctca gacgacagcc cggcccacag cagtgccatc 4260 
gggcccgtca ttggcatcat cctctctctc ttcgtcatgg gtggtgtcta ttttgtgtgc 4320 
cagcgcgtgg tgtgccagcg ctatgcgggg gccaacgggc ccttcccgca cgagtatgtc 43 80 
agcgggaccc cgcacgtgcc cctcaatttc atagccccgg gcggttccca gcatggcccc 4440 
ttcacaggca tcgcatgcgg aaagtccatg atgagctccg tgagcctgat ggggggccgg 4500 
ggcggggtgc ccctctacga ccggaaccac gtcacagggg cctcgtccag cagctcgtcc 4560 
agcacgaagg ccacgctgta cccgccgatc ctgaacccgc cgccctcccc ggccacggac 4620 
ccctccctgt acaacatgga catgttctac tcttcaaaca ttccggccac tgcgagaccg 4680 
tacaggccct acatcattcg aggaatggcg cccccgacga cgccctgcag caccgacgtg 4740 
tgtgacagcg actacagcgc cagccgctgg aaggccagca agtactacct ggatttgaac 4 800 
tcggactcag acccctatcc acccccaccc acgccccaca gccagtacct gtcggcggag 4860 
gacagctgcc cgccctcgcc cgccaccgag aggagctact tccatctctt cccgccccct 4920 
ccgtccccct gcacggactc atcctgacct cggccgggcc actctggctt ctctgtgccc 4980 
ctgtaaatag ttttaaatat gaacaaagaa aaaaatatat tttatgattt aaaaaataaa 504 0 
tataattggg attttaaaaa catgagaaat gtgaactgtg atggggtggg cagggctggg 5100 
agaactttgt acagtggaga aatatttata aacttaattt tgtaaaacag aaaaaaaaaa 5160 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 5220 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa gcggccgc 5278 

<210> 761 

<211> 1615 

<212> PRT 

<213> Homo sapiens 
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<400> 761 



Met Glu 


Ala 


Ala 


Pro 


Pro 


Gly 


Pro 


Pro 


Trp 
— 


Pro 


Leu 


Leu 


Leu 


Leu 


Leu 


1 






5 










10 










15 




Leu Leu 


Leu 


Leu 


Ala 


Leu 


Cys 


Gly 


Cys 


Pro 


Ala 


Pro 


Ala 


Ala 


Ala 


Ser 






20 










25 










30 






Pro Leu 


Leu 


Leu 


Phe 


Ala 


Asn 


Arg 


Arg 


Asp 


Val 


Arg 


Leu 


Val 


Asp 


Ala 




35 










40 










45 








Gly Gly 


Val 


Lvs 

2 » 


Leu 


Glu 


Ser 


Thr 


He 


Val 


Val 


Ser 


Gly 


Leu 


Glu 


Asp 


50 










55 










60 










Ala Ala 


Ala 


Val 


Asp 


Phe 


Gin 


Phe 


Ser 


Lvs 


Gly 


Ala 


Val 


Tvr 


Trp 

* Jr 


Thr 


65 








70 










75 










80 


Asp Val 


Ser 


Glu 


Glu 


Ala 


He 


Lvs 


Gin 


Thr 


Tvr 


Leu 


Asn 


Gin 


Thr 


Gly 








85 










90 










95 




Ala Ala 


Val 


Gin 


Asn 


Val 


Val 


He 


Ser 


Gly 


Leu 


Val 


Ser 


Pro 


Asp 


Gly 






100 










105 










110 






Leu Ala 


Cys 


Asp 


Trp 


Val 


Gly 


Lys 


Lys 


Leu 


Tyr 


Trp 


Thr 


Asp 


Ser 


Glu 




115 










120 










125 








Thr Asn 


Arq 


He 


Glu 


Val 


Ala 


Asn 


Leu 


Asn 


Gly 


Thr 


Ser 


Arg 


Lys 


Val 


130 










135 










140 










Leu Phe 


Trn 


Gin 


ASD 


Leu 


ASP 


Gin 


Pro 


Ara 


Ala 


He 


Ala 


Leu 


Asp 


Pro 


145 








150 










155 










160 


Ala His 


Glv 


Tvr 


Met 


Tvr 


Trp 


Thr 


Asp 


Trn 


Gly 


Glu 


Thr 


Pro 


Arg 


He 








165 










170 










175 




Glu Arg 


Ala 


Glv 


Met 


Asp 


Gly 


Ser 


Thr 


Ara 


Lvs 


He 


He 


Val 


Asp 

XT 


Ser 






180 










185 










190 






Asp He 


Tvr 

2 


Trp 


Pro 


Asn 


Gly 


Leu 


Thr 


He 


Asp 


Leu 


Glu 


Glu 


Gin 


Lys 




195 










200 










205 








Leu Tyr 


Trp 


Ala 


Asp 


Ala 


Lvs 


Leu 


Ser 


Phe 


lie 


His 


Arg 


Ala 


Asn 


Leu 


210 










215 










220 










Asp Gly 


Ser 


Phe 


Arcr 


Gin 


Lvs 


Val 


Val 


Glu 


Gly 


Ser 


Leu 


Thr 


His 


Pro 


225 








230 










235 










240 


Phe Al a 


Leu 


Thr 


Leu 


Ser 


Glv 


Asd 


Thr 


Leu 


Tvr 


Trp 


Thr 


Asp 


Trp 


Gin 








245 










250 










255 




Thr Arg 


Ser 


He 


His 


Ala 


Cvs 


Asn 


Lys 


Ara 


Thr 


Gly 


Gly 


Lvs 

2 •» 


Arq 


Lvs 






260 










265 










270 






Glu He 


Leu 


Ser 


Ala 


Leu 


Tvr 


Ser 


Pro 


Met 


Asp 


He 


Gin 


Val 


Leu 


Ser 




275 










280 










285 








Gin Glu 


Arq 


Gin 


Pro 


Phe 


Phe 


His 


Thr 


Arq 


Cys 


Glu 


Glu 


Asp 


Asn 


Gly 


290 










295 










300 










Gly Cys 


Ser 


His 


Leu 


Cys 


Leu 


Leu 


Ser 


Pro 


Ser 


Glu 


Pro 


Phe 


Tyr 


Thr 


305 








310 










315 










320 


Cys Ala 


Cys 


Pro 


Thr 


Gly 


Val 


Gin 


Leu 


Gin 


Asp 


Asn 


Gly 


Arg 


Thr 


Cys 








325 










330 










335 




Lys Ala 


Gly 


Ala 


Glu 


Glu 


Val 


Leu 


Leu 


Leu 


Ala 


Ara 


Ara 


Thr 


Asp 


Leu 






340 










345 










350 






Arg Arg 


He 


Ser 


Leu 


Asp 


Thr 


Pro 


Asp 


Phe 


Thr 


Asp 


He 


Val 


Leu 


Gin 




355 










360 










365 








Val Asp 


Asp 


He 


Arcr 


His 


Ala 


He 


Ala 


He 


Asp 


Tyr 


Asp 


Pro 


Leu 


Glu 


370 










375 










380 










Gly Tyr 


Val 


Tyr 


Trp 


Thr 


Asp 


Asp 


Glu 


Val 


Arg 


Ala 


He 


Arg 


Arg 


Ala 


385 








390 










395 










400 


Tyr Leu 


Asp 


Gly 


Ser 


Gly 


Ala 


Gin 


Thr 


Leu 


Val 


Asn 


Thr 


Glu 


He 


Asn 








405 










410 










415 




Asp Pro 


Asp 


Gly 


He 


Ala 


Val 


Asp 


Trp 


Val 


Ala 


Arg 


Asn 


Leu 


Tyr 


Trp 
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420 



Thr Asp Thr Gly 


Thr Asp Arg 


He 


435 






440 


Ser Arg Lys lie 


Leu 


Val Ser 


Glu 


450 




455 




Ala Leu His Pro 


Val Met Gly 


Leu 


465 




470 




Asn Pro Lys lie 


Glu 
485 


Cys Ala 


Asn 


Leu Val Asn Ala 


Ser Leu Gly 


Trp 


500 








Gin Glu Gly Lys 


Leu Tyr Trp 


Gly 


515 






520 


Val lie Asn Val 


Asp Gly Thr 


LVS 


530 




535 




Leu Pro His lie 


Phe Gly Phe 


Thr 


545 




550 




Thr Asp Trp Gin 


Arg Arg Ser 


He 




565 






Ser Arg Asp Val 


He 


He Asp 


Gin 


580 








Ala Val Asn Val 


Ala Lys Val 


Val 


595 






600 


Asn Gly Gly Cys 


Ser 


His Leu 


Cvs 


610 




615 




Cys Gly Cys Pro 


He 


Gly Leu 


Glu 


625 




630 




lie Val Pro Glu 


Ala 
645 


Phe Leu 


Val 


Arg lie Ser Leu 


Glu 


Thr Asn 


Asn 


660 








Gly Val Lys Glu 


Ala 


Ser Ala 


Leu 


675 






680 


He Tyr Trp Thr 


Asp 


Val Ser 


Leu 


690 




695 




Asn Gly Ser Ser 


Val 


Glu His 


Val 


705 




710 




Glu Glv Met Ala 


Val 
725 


Asp Trp 


Met 


Thr Gly Thr Asn 


Arg 


He Glu 


Val 


740 








Gin Val Leu Val 


Trp Arg Asp 


Leu 


755 






760 


Asp Pro Thr Lys 


Gly Tyr He 


Tyr 


770 




775 




Arg He Val Arg 


Ala 


Phe Met 


Asp 

IT 


785 




790 




Asn Lvs Val Glv 


Arg Ala Asn 


Asp 




805 






Arg Leu Tyr Trp 


Thr Asp Leu 


Asp 


820 








Met Leu Gly Gin 


Glu Arg Val 


Val 


835 






840 


Phe Gly Leu Thr 


Gin Tyr Ser 


Asp 



425 






430 






Glu Val Thr 


Arg 


Leu 


Asn 


Gly 


Thr 






445 








Asp Leu Asp 


Glu 


Pro Arg 


Ala 


He 




460 










Met Tyr Trp 


Thr 


Asp Trp 


Gly 


Glu 


475 










480 


Leu Asp Gly 


Gin 


Glu Arg 


Arg 


Val 


490 








495 




Pro Asn Gly 


Leu 


Ala 


Leu 


Asp 


Leu 


505 






510 






Asp Ala Lys 


Thr 


Asp 


Lys 


He 


Glu 






525 








Arg Arg Thr 


Leu 


Leu 


Glu 


Asp 


Lys 




540 










Leu Leu Gly 


Asp 


Phe 


He 


Tyr 


Trp 


555 










560 


Glu Arg Val 


His 


Lys 


Val 


Lys 


Ala 


570 








575 




Leu Pro Asp 


Leu 


Met 


Gly 


Leu 


Lys 


585 






590 






Gly Thr Asn 


Pro 


Cys 


Ala 


Asp 


Arg 






605 








Phe Phe Thr 


Pro 


His 


Ala 


Thr 


Arq 




620 










Leu Leu Ser 


Asp 


Met 


Lys 


Thr 


Cvs 


635 










64 0 


Phe Thr Ser 


Arq 


Ala 


Ala 


He 


His 


650 








655 




Asn Asp Val 


Ala 


He 


Pro 


Leu 


Thr 


665 






670 






Asp Phe Asp 


Val 


Ser 


Asn 


Asn 


His 






685 








Lys Thr He 


Ser 


Arg Ala 


Phe 


Met 




700 










Val Glu Phe 


Gly 


Leu Asp 


Tyr 


Pro 


715 










720 


Gly Lys Asn 


Leu 


Tyr Trp 


Ala 


Asp 


730 








735 




Ala Arg Leu 


Asp 


Gly Gin 


Phe 


Arg 


745 






750 






Asp Asn Pro 


Arg 


Ser 


Leu 


Ala 


Leu 






765 








Trp Thr Glu 


Trp 


Gly Gly 


Lys 


Pro 




780 










Gly Thr Asn 


Cys 


Met 


Thr 


Leu 


Val 


795 










800 


Leu Thr He 


Asp 


Tyr Ala 


Asp 


Gin 


810 








815 




Thr Asn Met 


He 


Glu 


Ser 


Ser 


Asn 


825 






830 






He Ala Asp 


Asp 


Leu 


Pro 


His 


Pro 






845 








Tyr He Tyr 


Trp 


Thr Asp 


Trp 


Asn 
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850 










855 










860 










Leu 


His 


Ser 


He 


Glu 


Arg 


Ala 


Asp 


Lys 


Thr 


Ser 


Gly 


Arg 


Asn 


Arg 


Thr 


865 










870 










875 










880 


Leu 


He 


Gin 


Gly 


His 
885 


Leu 


Asp 


Phe 


Val 


Met 
890 


Asp 


He 


Leu 


Val 


Phe 
895 


His 


Ser 


Ser 


Arg 


Gin 
900 


Asp 


Gly 


Leu 


Asn 


Asp 
905 


Cys 


Met 


His 


Asn 


Asn 
910 


Gly 


Gin 


Cvs 


Glv 


Gin 
915 


Leu 


Cys 


Leu 


Ala 


He 
920 


Pro 


Gly 


Gly 


His 


Arcr 
925 


Cys 


Gly 


Cys 


Ala 


Ser 
930 


His 


Tyr 


Thr 


Leu 


Asp 
935 


Pro 


Ser 


Ser 


Arg 


Asn 
940 


Cys 


Ser 


Pro 


Pro 


Thr 


Thr 


Phe 


Leu 


Leu 


Phe 


Ser 


Gin 


Lys 


Ser 


Ala 


He 


Ser 


Arg 


Met 


He 


945 










950 










955 










960 


Pro 


Asp 

It 


Asp 


Gin 


His 
965 


Ser 


Pro 


Asp 


Leu 


He 
970 


Leu 


Pro 


Leu 


His 


Gly 
975 


Leu 


Arcr 


Asn 


Val 


Lvs 
980 


Ala 


He 


Asp 


Tyr 


Asp 
985 


Pro 


Leu 


Asp 


Lys 


Phe 
990 


He 


Tyr 


Tttd 


Val 


Asp 


Glv 


Arcr 


Gin 


Asn 


He 


Lys 


Arcr 


Ala 


Lys 


Asp 


Asp 


Gly 


Thr 






995 










1000 








1005 






Gin 


Pro 


Phe 


Val 


Leu 


Thr 


Ser 


Leu 


Ser 


Gin 


Gly 


Gin 


Asn 


Pro 


Asp 


Arg 




1010 








1015 








1020 








Gin 


Pro 


His 


Asp 


Leu 


Ser 


He Asp 


He 


Tyr 


Ser 


Arg 


Thr 


Leu 


Phe 


Trp 


1025 








1030 








1035 








1040 


Thr 


Cys 


Glu 


Ala 


Thr 


Asn 


Thr 


He 


Asn 


Val 


His 


Arg 


Leu 


Ser 


Gly 


Glu 










1045 








1050 








1055 


Ala 


Met 


Gly 


Val 


Val 


Leu Arg Gly Asp 


Arg Asp 


Lys 


Pro Arg 


Ala 


He 








1060 








1065 








1070 




Val 


Val 


Asn 


Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gin Asp 



1075 1080 1085 



Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 

1090 1095 1100 

Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 1110 1115 1120 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He 

1125 1130 1135 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 

1155 1160 1165 

Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys Thr Thr 

1170 1175 1180 

Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His Leu Thr Gly 
1185 1190 . 1195 1200 

He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 

1205 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu He Asp Cys He Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 
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1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 ~ 1320 1325 

Glu Ala Asp Cys Asp Ala lie Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu lie Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 * 1350 1355 1360 

Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val He Gly 

1380 13 85 13 90 

He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He Ala Pro 
1425 ' 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 

1540 1545 1550 

Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 " 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 762 
<211> 1564 
<212> PRT 
<213> Mouse 

<400> 762 

Met Glu Thr Ala Pro Thr Arg Ala Pro Pro Pro Pro Pro Pro Pro Leu 

15 10 15 

Leu Leu Leu .Val Leu Tyr Cys Ser Leu Val Pro Ala Ala Ala Ser Pro 

20 25 30 

Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala Gly 
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35 










40 


Glv 


Val 


Lvs 


Leu 


Glu 


Ser 


Thr 


He 




50 










55 




Ala 


Ala 


Val 


Asp 


Phe 


Gin 


Phe 


Ser 


65 










70 






Val 


Ser 


Glu 


Glu 


Ala 


He 


Lvs 


Gin 










85 








Ala 


Ala 


Gin 


Asn 


He 


Val 


lie 


Ser 








100 










Ala 


Cys 


Asp 


Trp 


Val 


Gly 


Lys 


Lys 






115 










120 


Asn 


Arg 


He 


Glu 


Val 


Ala 


Asn 


Leu 




130 










135 




Phe 


Trp 


Gin 


Asp 

It 


Leu 


Asp 


Gin 


Pro 


145 










150 






His 


Glv 


Tvr 


Met 


Tvr 


Tro 


Thr 


Asp 










165 








Arg 


Ala 


Gly 


Met 


Asp 


Gly 


Ser 


Thr 








180 










lie 


Tyr 


Trp 

It 


Pro 


Asn 


Gly 


Leu 


Thr 






195 










200 


Tvr 


Trp 


Ala 


ASP 


Ala 


Lvs 


Leu 


Ser 




210 










215 




Gly 


Ser 


Phe 


Arg 


Gin 


Lys 


Val 


Val 


225 










230 






Ala 


Leu 


Thr 


Leu 


Ser 


Glv 


Asp 


Thr 










245 








Aver 


Ser 


He 


His 


Ala 


Cys 


Asn 


Lvs 








260 










Tie 


Leu 


Ser 


Ala 


Leu 


Tvr 


Ser 


Pro 
















280 


Glu 


Arg 


Gin 


Pro 


Pro 


Phe 


His 


Thr 




290 










295 




Cys 


Ser 


His 


Leu 


Cvs 


Leu 


Leu 


Ser 


305 










310 






Ala 


Cvs 


Pro 


Thr 


Gly 


Val 


Gin 


Leu 










325 








Thr 


Glv 


Ala 


Glu 


Glu 


Val 


Leu 


Leu 








340 










Arcr 


lie 


Ser 


Leu 


Asp 


Thr 


Pro 


Asp 






3 55 










360 


Glv 


Asp 


He 


Arcr 


His 


Ala 


He 


Ala 




370 










375 




Tvx 


Val 


Tvr 


Trp 


Thr 


Asp 


Asp 


Glu 


J O J 










390 






Leu 


Asp 


Gly 


Ser 


Gly 


Ala 


Gin 


Thr 










405 








Pro 


Asp 


Gly 


He 


Ala 


Val 


Asp 


Trp 








420 










Asp 


Thr 


Gly 


Thr 


Asp 


Arcr 


He 


Glu 






435 










440 


Arg 


Lys 


He 


Leu 


Val 


Ser 


Glu 


Asp 




450 










455 




Leu 


His 


Pro 


Val 


Met 


Gly 


Leu 


Met 



45 



Val 


Ala 


Ser 


Gly 


Leu 


Glu 


Asp 


Ala 








60 










Lys 


Gly 


Ala 


Val 


Tyr 


Trp 


Thr 


Asp 






75 










80 


Thr 


Tyr 


Leu 


Asn 


Gin 


Thr 


Gly 


Ala 




90 










95 




Gly 


Leu 


Val 


Ser 


Pro 


Asp 


Gly 


Leu 


105 










110 






Leu 


Tyr 


Trp 


Thr 


Asp 


Ser 


Glu 


Thr 










125 








Asn 


Gly 


Thr 


Ser 


Arg 


Lys 


Val 


Leu 








140 










Arcr 


Ala 


He 


Ala 


Leu 


Asp 

* * XT 


Pro 


Ala 






155 










160 


Tro 


Glv 


Glu 


Ala 


Pro 


Arq 


He 


Glu 




170 










175 




Arg 


Lvs 


He 


He 


Val 


Asp 


Ser 


Asp 


185 










190 






He 


Asp 


Leu 


Glu 


Glu 


Gin 


Lys 


Leu 










205 








Phe 


He 


His 


Arq 


Ala 


Asn 


Leu 


Asp 








220 










Glu 


Gly 


Ser 


Leu 


Thr 


His 


Pro 


Phe 






235 










240 


Leu 


Tvr 


Trp 


Thr 


Asp 


Trp 


Gin 


Thr 




250 










255 




Trp 


Thr 


Gly 


Glu 


Gin 


Arq 


Lys 


Glu 


265 










270 






Met 


Asp 


He 


Gin 


Val 


Leu 


Ser 


Gin 










285 








Pro 


Cvs 


Glu 


Glu 


Asp 


Asn 


Gly 


Gly 








300 










Pro 


Arq 


Glu 


Pro 


Phe 


Tyr 


Ser 


Cys 






315 










320 


Gin 


Asp 


Asn 


Gly 


Lys 


Thr 


Cys 


Lys 




330 










335 




Leu 


Ala 


Arq 


Arq 


Thr 


Asp 


Leu 


Arg 


345 










350 






Phe 


Thr 


Asp 


He 


Val 


Leu 


Gin 


Val 










365 








He 


Asp 


Tyr 


Asp 


Pro 


Leu 


Glu 


Gly 








380 










Val 


Arq 


Ala 


He 


Arg 


Arg 


Ala 


Tyr 






395 










400 


Leu 


Val 


Asn 


Thr 


Glu 


He 


Asn 


Asp 




410 










415 




Val 


Ala 


Arg 


Asn 


Leu 


Tyr 


Trp 


Thr 


425 










430 






Val 


Thr 


Arg 


Leu 


Asn 


Gly 


Thr 


Ser 










445 








Leu 


Asp 


Glu 


Pro 


Arg 


Ala 


He 


Val 








460 










Tyr 


Trp 


Thr 


Asp 


Trp 


Gly 


Glu 


Asn 
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465 470 475 480 

Pro Lys lie Glu Cys Ala Asn Leu Asp Gly Arg Asp Arg His Val Leu 

485 490 495 

Val Asn Thr Ser Leu Gly Trp Pro Asn Gly Leu Ala Leu Asp Leu Gin 

500 505 510 

Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp Lys He Glu Val 

515 520 525 

He Asn He Asp Gly Thr Lys Arg Lys Thr Leu Leu Glu Asp Lys Leu 

530 535 540 

Pro His He Phe Gly Phe Thr Leu Leu Gly Asp Phe He Tyr Trp Thr 
545 550 555 560 

Asp Trp Gin Arg Arg Ser He Glu Arg Val His Lys Val Lys Ala Ser 

565 570 575 

Arg Asp Val He He Asp Gin Leu Pro Asp Leu Met Gly Leu Lys Ala 

580 585 590 

Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys Ala Asp Gly Asn 

595 600 605 

Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro Arg Ala Thr Lys Cys 

610 615 620 

Gly Cys Pro He Gly Leu Glu Leu Leu Ser Asp Met Lys Thr Cys He 
625 630 635 640 

He Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala Thr He His Arg 

645 650 655 

He Ser Leu Glu Thr Asn Asn Asn Asp Val Ala He Pro Leu Thr Gly 

660 665 670 

Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser Asn Asn His He 

675 680 685 

Tyr Trp Thr Asp Val Ser Leu Lys Thr lie Ser Arg Ala Phe Met Asn 

690 695 700 

Gly Ser Ser Val Glu His Val He Glu Phe Gly Leu Asp Tyr Pro Glu 
705 710 715 720 

Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr Trp Ala Asp Thr 

725 730 735 

Gly Thr Asn Arg He Glu Val Ala Arg Leu Asp Gly Gin Phe Arg Gin 

740 745 750 

Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser Leu Ala Leu Asp 

755 760 765 

Pro Thr Lys Gly Tyr He Tyr Trp Thr Glu Trp Gly Gly Lys Pro Arg 

770 775 780 

He Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met Thr Leu Val Asp 
785 790 795 800 

Lys Val Gly Arg Ala Asn Asp Leu Thr He Asp Tyr Ala Asp Gin Arg 

805 810 815 

Leu Tyr Trp Thr Asp Leu Asp Thr Asn Met He Glu Ser Ser Asn Met 

820 825 830 

Leu Gly Gin Glu Arg Met Val lie Ala Asp Asp Leu Pro Tyr Pro Phe 

835 840 845 

Gly Leu Thr Gin Tyr Ser Asp Tyr He Tyr Trp Thr Asp Trp Asn Leu 

850 855 860 

His Ser He Glu Arg Ala Asp Lys Thr Ser Gly Arg Asn Arg Thr Leu 
865 870 875 880 

lie Gin Gly His Leu Asp Phe Val Met Asp He Leu Val Phe His Ser 

885 890 895 

Ser Arg Gin Asp Gly Leu Asn Asp Cys Val His Ser Asn Gly Gin Cys 
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900 905 910 

Gly Gin Leu Cys Leu Ala lie Pro Gly Gly His Arg Cys Gly Cys Ala 

915 920 925 

Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys Ser Pro Pro Ser 

930 935 940 

Thr Phe Leu Leu Phe Ser Gin Lys Phe Ala lie Ser Arg Met lie Pro 
945 950 955 960 

Asp Asp Gin Leu Ser Pro Asp Leu Val Leu Pro Leu His Gly Leu Arg 

965 970 975 

Asn Val Lys Ala He Asn Tyr Asp Pro Leu Asp Lys Phe He Tyr Trp 

980 985 990 

Val Asp Gly Arg Gin Asn He Lys Arg Ala Lys Asp Asp Gly Thr Gin 

995 1000 1005 

Pro Ser Met Leu Thr Ser Pro Ser Gin Ser Leu Ser Pro Asp Arg Gin 

1010 1015 1020 

Pro His Asp Leu Ser He Asp He Tyr Ser Arg Thr Leu Phe Trp Thr 
1025 1030 1035 1040 

Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu Asp Gly Asp Ala 

1045 1050 1055 

Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala He Ala 

1060 1065 1070 

Val Asn Ala Glu Arg Gly Tyr Met Tyr Phe Thr Asn Met Gin Asp His 

1075 1080 1085 

Ala Ala Lys He Glu Arg Ala Ser Leu Asp Gly Thr Glu Arg Glu Val 

1090 * 1095 1100 

Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp Asn 
1105 1110 1115 1120 

Ala Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He Glu 

1125 1130 1135 

Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala Asn 

1140 1145 1150 

He Val Gin Pro Val Gly Leu Thr Val Leu Gly Arg His Leu Tyr Trp 

1155 1160 1165 

He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys Thr Thr Gly 

1170 1175 1180 

Asp Lys Arg Thr Arg Val Gin Gly Arg Val Thr His Leu Thr Gly lie 
1185 1190 1195 1200 

His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro Cys 

1205 1210 1215 

Ala Arg Asp Asn Gly Gly Cys Ser His lie Cys He Ala Lys Gly Asp 

1220 1225 1230 

Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin Asn 

1235 1240 1245 

Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe Ala 

1250 1255 1260 

Cys Thr Thr Gly Glu He Asp Cys He Pro Gly Ala Trp Arg Cys Asp 
1265 1270 1275 1280 

Gly Phe Pro Glu Cys Ala Asp Gin Ser Asp Glu Glu Gly Cys Pro Val 

1285 1290 1295 

Cys Ser Ala Ser Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp Leu 

1300 1305 1310 

Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp Glu 

1315 1320 1325 

Ala Asn Cys Asp Ala Val Cys Leu Pro Asn Gin Phe Arg Cys Thr Ser 
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1330 1335 1340 

Gly Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin Arg Val Met 
1345 1350 1355 1360 

Cys Gin Arg Tyr Thr Gly Ala Ser Gly Pro Phe Pro His Glu Tyr Val 

1365 1370 1375 

Gly Gly Ala Pro His Val Pro Leu Asn Phe lie Ala Pro Gly Gly Ser 

1380 1385 1390 

Gin His Gly Pro Phe Pro Gly He Pro Cys Ser Lys Ser Val Met Ser 

1395 1400 1405 

Ser Met Ser Leu Val Gly Gly Arg Gly Ser Val Pro Leu Tyr Asp Arg 

1410 1415 1420 

Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser Thr Lys Ala 
1425 1430 1435 1440 

• Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro Ser Pro Ala Thr Asp 
1445 1450 1455 

Pro Ser Leu Tyr Asn Val Asp Val Phe Tyr Ser Ser Gly He Pro Ala 

1460 1465 1470 

Thr Ala Arg Pro Tyr Arg Pro Tyr Val He Arg Gly Met Ala Pro Pro 

1475 1480 1485 

Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr Ser He Ser 

1490 1495 1500 

Arg Trp Lys Ser Ser Lys Tyr Tyr Leu Asp Leu Asn Ser Asp Ser Asp 
1505 1510 1515 1520 

Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu Ser Ala Glu 

1525 1530 1535 

Asp Ser Cys Pro Pro Ser Pro Gly Thr Glu Arg Ser Tyr Cys His Leu 

1540 1545 1550 

Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1555 1560 



<210> 763 

<211> 1665 

<212> PRT 

<213> Homo sapiens 



<400> 763 



Met 


Glu 


Ala 


Ala 


Pro 


Pro 


Gly Pro 


Pro 


Trp 


Pro Leu 


Leu 


Leu 


Leu 


Leu 


1 








5 








10 








15 




Leu 


Leu 


Leu 


Leu 


Ala 


Leu 


Cys Gly 


Cys 


Pro 


Ala Pro 


Ala 


Ala 


Ala 


Ser 








20 








25 








30 






Pro 


Leu 


Leu 


Leu 


Phe 


Ala 


Asn Arg 


Arg 


Asp 


Val Arg 


Leu 


Val 


Asp 


Ala 






35 








40 








45 








Gly 


Gly 


Val 


Lys 


Leu 


Glu 


Ser Thr 


He 


Val 


Val Ser 


Gly 


Leu 


Glu 


Asp 




50 










55 






60 










Ala 


Ala 


Ala 


Val 


Asp 


Phe 


Gin Phe 


Ser 


Lys 


Gly Ala 


Val 


Tyr 


Trp 


Thr 


65 










70 








75 








80 


Asp 


Val 


Ser 


Glu 


Glu 


Ala 


He Lys 


Gin 


Thr 


Tyr Leu 


Asn 


Gin 


Thr 


Gly 










85 








90 








95 




Ala 


Ala 


Val 


Gin 


Asn 


Val 


Val He 


Ser 


Gly 


Leu Val 


Ser 


Pro 


Asp 


Gly 








100 








105 








110 






Leu 


Ala 


Cys 


Asp 


Trp Val 


Gly Lys 


Lys 


Leu 


Tyr Trp 


Thr 


Asp 


Ser 


Glu 






115 








120 








125 








Thr 


Asn 


Arg 


He 


Glu 


Val 


Ala Asn 


Leu 


Asn 


Gly Thr 


Ser 


Arg 


Lys 


Val 
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130 135 140 



Lsu 


Phe 


up 


Gin 


Asp 


Leu 


Asp 


Gin 


Pro 


Arcr 


Ala 


He 


Ala 


Leu 


Asp 


Pro 


145 










150 










155 










160 


Ala 


His 


Glv 


Tvr 


Met 


Tvr 


Trp 


Thr 


Asp 


Trp 


Gly 


Glu 


Thr 


Pro 


Arg 


He 










165 










170 










175 




Glu 


Arcr 

niy 


Ala 


Glv 


Met 


Asp 


Glv 


Ser 


Thr 


Arg 


Lys 


He 


He 


Val 


Asp 


Ser 








180 










185 










190 






A«3T*> 


lie 


iy i 


Trp 


Pro 


Asn 


Glv 


Leu 


Thr 


He 


Asp 


Leu 


Glu 


Glu 


Gin 


Lys 






195 










200 










205 








i/CU 


Tvr 


lip 


Ala 


Asp 


Ala 


Lys 


Leu 


Ser 


Phe 


He 


His 


Arcr 


Ala 


Asn 


Leu 




210 










215 










220 














Ser 


IT lie: 




Gin 


Lys 


Val 


Val 


Glu 


Glv 


Ser 


Leu 


Thr 


His 


Pro 


225 










230 










235 










240 


Phe 


Ala 


Leu 


Thr 


Leu 


Ser 


Glv 


Asp 


Thr 


Leu 


Tvr 


Trp 


Thr 


Asp 

* XT 


Trp 

XT 


Gin 










245 










250 










255 




Thr 


Ara 

«iy 


Ser 


He 


His 


Ala 


Cvs 


Asn 


Lvs 


Ara 


Thr 


Gly 


Gly 


Lys 


Arg 


Lys 








260 










265 










270 






Glu 


lie 


Leu 


Ser 


Ala 


Leu 


Tvr 


Ser 


Pro 


Met 


Asp 


He 


Gin 


Val 


Leu 


Ser 






275 










280 










285 








Olll 


din 

ul LL 


Axy 


VJlil 


pv-n 


Phe 


Phe 


His 


Thr 


Arcr 


Cys 


Glu 


Glu 


Asp 


Asn 


Gly 














295 










300 










C*~\ \T 


Cys 


OC1 


111 b 


Ti^i 1 
uc u 




ucu 


Leu 


Ser 


Pro 


Ser 


Glu 


Pro 


Phe 


Tvr 

a 


Thr 


_> \J -) 








310 










315 










320 


/""•we 

L.ys 


ai s 

rrJL a. 




tz 1 


i in 


Gly 


Val 


Gin 


Leu 


Gin 


Asp 


Asn 


Glv 


Ara 


Thr 


CVS 










325 










330 










335 




Ta/c 

u_y & 


Aid. 


Gly 


Ala 


Glu 


Glu 


Val 


Leu 


Leu 


Leu 


Ala 


Arg 


Arcr 


Thr 


Asp 


Leu 








340 










345 










350 










lie 


Ser 


Leu 


Asp 


Thr 


Pro 


Asp 


Phe 


Thr 


Asp 


He 


Val 


Leu 


Gin 






355 










360 










365 








VQl 






Tie 




His 


Ala 


He 


Ala 


He 


Aso 


Tvr 


Asp 


Pro 


Leu 


Glu 




3 70 










375 










380 










pi v 


Tyr 


V ctl 


iyr 


irp 


i in 


nop 




Glu 


Val 


Arg 


Ala 


He 


Arcr 


Arg 


Ala 


JOJ 










390 










395 










400 


Tyr 


lieu 


7\ en 
nop 


uiy 


O CI 


\j ~l y 


Ala 


Gin 


Thr 


Leu 


Val 


Asn 


Thr 


Glu 


He 


Asn 










405 










410 










415 




Asp 


Pro 


Asp 


Glv 


He 


Ala 


Val 


Asp 


Trp 


Val 


Ala 


Arcr 


Asn 


Leu 


Tyr 


Trp 








42 0 










425 










430 






Thr 


Asp 


Thr 


Glv 


Thr 


Asp 


Arq 


He 


Glu 


Val 


Thr 


Arg 


Leu 


Asn 


Gly 


Thr 






435 










440 










445 








Ser 


Arg 


Lys 


He 


Leu 


Val 


Ser 


Glu 


Asp 


Leu 


Asp 


Glu 


Pro 


Arg 


Ala 


He 




450 










455 










460 










Al a 


JUCU 


His 


Pro 


Val 


Met 


Glv 


Leu 


Met 


Tvr 


Trp 


Thr 


Asp 


Trp 


Gly 


Glu 


465 










470 










475 










480 


Asn 


rJ. <J 




He 

lie 


Glu 


V— jr ID 


Ala 


Asn 


Leu 


ASD 


Glv 


Gin 


Glu 


Arg 


Arcr 


Val 










485 










490 










495 




UC LL 


Val 

V Ctl 


Asn 


Ala 


Ser 


Leu 


Glv 


Tm 


Pro 


Asn 


Glv 


Leu 


Ala 


Leu 


Asp 


Leu 








500 










505 










510 






will 


Glu 


Glv 


Lys 


Leu 


Tvr 


Tt~D 
up 


Glv 


Asp 


Ala 


LVS 


Thr 


Asp 


Lvs 


He 


Glu 






515 










520 










525 








Val 


He 


As ii 


Val 


Asp 


Gly 


Thr 


Lys 


Arg 


Arg 


Thr 


Leu 


Leu 


Glu 


Asp 


Lys 




530 










535 










540 










Leu 


Pro 


His 


He 


Phe 


Gly 


Phe 


Thr 


Leu 


Leu 


Gly 


Asp 


Phe 


He 


Tyr 


Trp 


545 










550 










555 










560 


Thr 


Asp 


Trp 


Gin 


Arg 


Arg 


Ser 


He 


Glu 


Arg 


Val 


His 


Lys 


Val 


Lys 


Ala 
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565 



Ser Arg 


Asp 


Val 


He 


He 


Asp 


Gin 






580 










Ala Val 


Asn 


Val 


Ala 


Lys 


Val 


Val 




595 










600 


Asn Gly 


Gly 


Cys 


Ser 


His 


Leu 


Cys 


610 










615 




Cys Gly 


Cys 


Pro 


He 


Gly 


Leu 


Glu 


625 








630 






He Val 


Pro 


Glu 


Ala 


Phe 


Leu 


Val 








645 








Arg He 


Ser 


Leu 


Glu 


Thr 


Asn 


Asn 






660 










Gly Val 


Lys 


Glu 


Ala 


Ser 


Ala 


Leu 




675 










680 


He Tyr 


Tro 


Thr 


Asp 


Val 


Ser 


Leu 


690 










695 




Asn Gly 


Ser 


Ser 


Val 


Glu 


His 


Val 


705 








710 






Glu Gly 


Met 


Ala 


Val 


Asp 


Trp 


Met 








725 








Thr Gly 


Thr 


Asn 


Arg 


He 


Glu 


Val 






740 










Gin Val 


Leu 


Val 


Trp 


Ara 


Asp 


Leu 




755 










760 


Asp Pro 


Thr 


Lvs 


Gly 


Tvr 


He 


Tvr 


770 










775 




Arg He 


Val 


Arg 


Ala 


Phe 


Met 


Asp 


785 








790 






Asp Lvs 


Val 


Gly 


Arg 


Ala 


Asn 


Asp 








805 








Arg Leu 


Tyr 


Trp 


Thr 


Asp 


Leu 


Asp 






820 










Met Leu 


Gly 


Gin 


Glu 


Arg 


Val 


Val 




835 










840 


Phe Gly 


Leu 


Thr 


Gin 


Tvr 


Ser 


Asp 


850 










855 




Leu His 


Ser 


He 


Glu 


Ara 


Ala 


Asp 


865 








870 






Leu He 


Gin 


Gly 


His 


Leu 


Asp 


Phe 








885 








Ser Ser 


Arg 


Gin 


Asp 


Gly 


Leu 


Asn 






900 










Cys Gly 


Gin 


Leu 


Cys 


Leu 


Ala 


He 




915 










920 


Ala Ser 


His 


Tyr 


Thr 


Leu 


Asp 


Pro 


930 










935 




Thr Thr 


Phe 


Leu 


Leu 


Phe 


Ser 


Gin 


945 








950 






Pro Asp 


Asp 


Gin 


His 


Ser 


Pro 


Asp 








965 








Arg Asn 


Val 


Lys 


Ala 


He 


Asp 


Tyr 






980 










Trp Val 


Asp 


Gly 


Arg 


Gin 


Asn 


Phe 





570 










575 




Leu 


Pro 


Asp 


Leu 


Met 


Gly 


Leu 


Lys 

2 


585 










590 






Gly 


Thr 


Asn 


Pro 


Cys 


Ala 


Asp 

XT 


Arg 










605 








Phe 


Phe 


Thr 


Pro 


His 


Ala 


Thr 


Arg 








620 










Leu 


Leu 


Ser 


Asp 


Met 


Lys 


Thr 


Cys 






635 










640 


Phe 


Thr 


Ser 


Arg 


Ala 


Ala 


He 


His 




650 










655 




Asn 


Asp 


Val 


Ala 


He 


Pro 


Leu 


Thr 


665 










670 






Asp 


Phe 


Asp 


Val 


Ser 


Asn 


Asn 


His 










685 








Lys 


Thr 


He 


Ser 


Arg 


Ala 


Phe 


Met 








700 










Val 


Glu 


Phe 


Gly 


Leu 


Asp 


Tyr 


Pro 






715 










720 


Gly 


Lys 


Asn 


Leu 


Tyr 


Trp 


Ala 


Asp 




730 










735 




Ala 


Arg 


Leu 


Asp 


Gly 


Gin 


Phe 


Arg 


745 










750 






Asp 


Asn 


Pro 


Arg 


Ser 


Leu 


Ala 


Leu 










765 








Trp 


Thr 


Glu 


Trp 

tr 


Gly 


Gly 


Lys 


Pro 








780 










Glv 


Thr 


Asn 


Cys 


Met 


Thr 


Leu 


Val 






795 










800 


Leu 


Thr 


He 


Asp 


Tyr 


Ala 


Asp 

■IT 


Gin 




810 










815 




Thr 


Asn 


Met 


He 


Glu 


Ser 


Ser 


Asn 


825 










830 






He 


Ala 


Asp 


Asp 


Leu 


Pro 


His 


Pro 










845 








Tvr 


He 


Tvr 


Trp 


Thr 


Asp 


Trp 


Asn 








860 










Lvs 

JL 


Thr 


Ser 


Gly 


Ara 


Asn 


Arg 


Thr 






875 










880 


Val 


Met 


Asp 


lie 


Leu 


Val 


Phe 


His 




890 










895 




Asp 


CVS 


Met 


His 


Asn 


Asn 


Gly 


Gin 


905 










910 






Pro 


Gly 


Gly 


His 


Arg 


Cys 


Gly 


Cys 










925 








Ser 


Ser 


Arg 


Asn 


Cys 

2 


Ser 


Pro 


Pro 








940 










Lys 


Ser 


Ala 


He 


Ser 


Arg 


Met 


He 






955 










960 


Leu 


He 


Leu 


Pro 


Leu 


His 


Gly 


Leu 




970 










975 




Asp 


Pro 


Leu 


Asp 


Lys 


Phe 


He 


Tyr 


985 










990 






Ser 


Gin 


Lys 


Ser 


Ala 


He 


Asn 


Arg 
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995 1000 1005 

Met Val lie Asp Glu Gin Gin Ser Pro Asp lie lie Leu Pro lie His 

1010 1015 1020 

Ser Leu Arg Asn Val Arg Ala lie Asp Tyr Asp Pro Leu Asp Lys Gin 
1025 1030 1035 1040 

Leu Tyr Trp lie Asp Ser Arg Gin Asn He Lys Arg Ala Lys Asp Asp 

1045 1050 1055 

Gly Thr Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn Pro 

1060 1065 1070 

Asp Arg Gin Pro His Asp Leu Ser He Asp He Tyr Ser Arg Thr Leu 

1075 1080 1085 

Phe Trp Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu Ser 

1090 1095 HOO 

Gly Glu Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg 
1105 1110 1115 1120 

Ala He Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met 

1125 1130 1135 

Gin Asp Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr Glu 

1140 1145 1150 

Arg Glu Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val 

1155 1160 1165 

Val Asp Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys 

1170 1175 1180 

Arg He Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu 
1185 1190 1195 1200 

Asp Ala Asn lie Val Gin Pro Leu Gly Leu Thr lie Leu Gly Lys His 

1205 1210 1215 

Leu Tyr Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys 

1220 1225 1230 

Thr Thr Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His Leu 

1235 1240 1245 

Thr Gly He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala 

1250 1255 1260 

His Pro Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala 
1265 1270 1275 1280 

Lys Gly Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu 

1285 1290 1295 

Leu Gin Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp 

1300 1305 1310 

Gin Phe Ala Cys Ala Thr Gly Glu He Asp Cys He Pro Gly Ala Trp 

1315 ~ 1320 1325 

Arg Cys Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly 

1330 1335 1340 

Cys Pro Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys 
1345 1350 1355 1360 

Val Asp Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg 

1365 1370 1375 

Ser Asp Glu Ala Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe Arg 

1380 1385 1390 

Cys Ala Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser Phe 

1395 1400 1405 

Pro Asp Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys 

1410 1415 1420 

Pro Pro Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val 
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1425 1430 1435 1440 

He Gly He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val 

1445 1450 1455 

Cys Gin Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe 

1460 1465 1470 

Pro His Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He 

1475 1480 1485 

Ala Pro Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys Gly 

1490 1495 1500 

Lys Ser Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val 
1505 1510 1515 1520 

Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser 

1525 1530 1535 

Ser Ser Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro 

1540 1545 1550 

Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser 

1555 1560 1565 

Ser Asn He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He Arg 

1570 1575 1580 

Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser 
1585 1590 1595 1600 

Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu 

1605 1610 1615 

Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin 

1620 1625 1630 

Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg 

1635 1640 1645 

Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser 
1650 1655 1660 

Ser 
1665 



<210> 764 
<211> 1613 
<212> PRT 

<213> Homo sapiens 



<400> 764 



Met 


Gly 


Ala 


Val 


Leu 


Arg 


Ser 


Leu Leu 


Ala Cys Ser 


Phe 


Cys 


Val 


Leu 


1 








5 








10 






15 




Leu 


Arg 


Ala 


Ala 


Pro 


Leu 


Leu 


Leu Tyr 


Ala Asn Arg 


Arg 


Asp 


Leu 


Arg 








20 








25 






30 






Leu 


Val 


Asp 


Ala 


Thr 


Asn 


Gly 


Lys Glu 


Asn Ala Thr 


He 


Val 


Val 


Gly 






35 










40 




45 








Gly 


Leu 


Glu 


Asp 


Ala 


Ala 


Ala 


Val Asp 


Phe Val Phe 


Ser 


His 


Gly 


Leu 




50 










55 




60 










He 


Tyr 


Trp 


Ser 


Asp 


Val 


Ser 


Glu Glu 


Ala He Lys 


Arg 


Thr 


Glu 


Phe 


65 










70 






75 








80 


Asn 


Lys 


Thr 


Glu 


Ser 


Val 


Gin 


Asn Val 


Val Val Ser 


Gly 


Leu 


Leu 


Ser 










85 








90 






95 




Pro 


Asp 


Gly 


Leu 


Ala 


Cys 


Asp 


Trp Leu 


Gly Glu Lys 


Leu 


Tyr 


Trp 


Thr 








100 








105 






110 






Asp 


Ser 


Glu 


Thr 


Asn Arg 


He 


Glu Val 


Ser Asn Leu 


Asp 


Gly 


Ser 


Leu 
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11C 

liJ 




Arg 


Lys 


Val 


Leu 




130 






Leu 


Asp 


Pro 


OCX 


-J AC 










T,vC 


Tie 

X X C 


Glu 


Tl e 

IXC 




Ser 


Glu 








180 


ol U 


m n 

Olll 


uy o 


Leu 










Ser 


Asn 


Leu 


Asp 




210 






irX U 


nib 


xrJL \J 


.true 


o o c 
22b 








Asp 


Trp 


DC! 


Thr 


c?i v 

oiy 


Leu 




Glu 








960 

O \J 


Ala 


Phe 


Ser 


Gin 










Asp 


Asn 


Gly 


Gly 




290 






Irxlc 


Tyr 


fJl Tl 
UJ.11 


v-y to 


3 0b 








Lys 


xnr 


Cys 


Lys 


Thr 


Asp 


DC LI 


-arg 








O ^fc U 


Val 

vox 


Leu 


Gin 


Leu 










Pro 


Val 


Glu 


Gly 




370 






Arg 




OCX 


Phe* 


T o c 

3 8b 








o-J.Il 


IXC 


Ala 


TT-| e 


Leu 


Tyr 


Trp 


Thr 
lllx 








a o n 
4 ^ u 


.ritpil 


ftl V 


Thr 


Met 










Arg 


Ala 


He 


Val 




450 






Tr*T> 


Gly 


Glu 


He 


4ob 








Arg 


vdl 


v ax 


JJC LI 


T.an 


ir 


i y j - 


Asp 








Ou VJ 




Tip 


Gl 17 

ox u. 


Val 






515 




Glu 


Asp 


Lys 


He 




530 






Val 


Tyr 


Trp 


Thr 



12 0 



Phe 


up 


r»l n 

UlU 


Glu 






135 




Ser 


Gly 


Phe 


Met 




150 






Arg 


Ala 


Glv 

oxy 


Met 


165 








He 


Tvr 

xy x 


Tiro 

X X±J 


Pro 


Tvr 

xy x 


xx y 


Ala 


Asp 








200 


VJljf 


Thr* 

111X 




Aiy 






215 




Ala 


Leu 


Thr 


Leu 




Z> J V 






nx o 


Car 


Tl *=> 

1 1C 


Leu 


245 








He 


His 


Ser 


Asp 


Gin 




Gin 


Pro 








280 


Cys 


Ser 


His 


Leu 






295 




Ala 


Cys 


Pro 


Thr 




j j- \j 






Ato LJ 


biy 


Al a 
Aid 


Th t* 

X 11X 


325 








Arg 


Tl f=» 


OCX 




Glu 


Asp 


He 










360 


iyx 


Tl P 


Tvr 
iyi 


xxp 






375 




He 


Asp 


Gly 


Ser 




390 






Pro 


Asp 


oiy 


Tl 


*± v o 








Asp 


11117 


uiy 


Thr 

X1X1 


7\ -r~Cl 


xiy o 


Tl ^ 
lie 


Leu 








440 


Leu 


Zi e-r\ 

AO 


Pro 


Met 






455 




Pro 


Lys 


He 


Glu 




470 






Val 


7A an 
rioll 


X ill 


Ser 


4 85 








Glu 


oxy 


xiy & 


He 


Met 


Asn 


Thr 


Asp 








520 


Pro 


His 


He 


Phe 






535 




Asp 


Trp 


Gin 


Arg 



ucu 


rib jJ 


Rl n 

OX IX 


Prpj 
Xr x o 








14 0 


Tvr 
xyi 


Tm 
xip 


Thr 


Asp 






155 




Asp 


Gly 


Ser 


Ser 




170 






Asn 


Glv 


Leu 


Thr 


185 








Ala 


Lys 


Leu 


Asn 


ttl n 

Olil 


Al a 


Val 

vdl 


Val 

v ax 








990 


XT 11C 


ox u. 




Tl P 

X X c 






235 




Ala 


Cys 


Asn 


Lys 




250 






He 


Phe 


Ser 


Pro 


265 








Asn 


Ala 


Thr 


Asn 


* — _y o 


Leu 


Met 


Ser 








300 


oiy 


Val 
val 


Xjy to 


ucu. 






315 




Glu 


Leu 


Leu 


Leu 




330 






7\ en 


Thr 

nix 


Drn 

XT X <J 


X*Xto LJ 


345 








xix a 


Ala 


He 

X X c 


Ala 


Thr 


Asp 


Asp 


Glu 








380 


oxy 


Qpv* 
OCX 


f?l n 

OX XX 


Phe* 

JrllC 






395 




Ala 


Val 


Asp 


Trp 




410 






Asp 


21 rrr 

Arg 


Tl 
lie 


OX IX 


425 








He 


Ser 


Glu 


Asp 


Val 


Glv 

oxy 


Tvr 

j.y x 


Met 








460 


7\ T"fT 

ALy 


Ala 


Ala 


Leu 






475 




Leu 


Gly 


Trp 


Pro 




490 






iyr 


xrp 


oxy 


A an 


505 








Gly 


Thr 


Gly 


Arg 


Gly 


Phe 


Thr 


Leu 








540 


Arg 


Ser 


He 


Glu 



125 

Arg Ala He Ala 

Trp Gly Glu Val 
160 

Arg Phe He He 
175 

Leu Asp Tyr Glu 
190 

Phe He His Lys 
205 

Lys Gly Ser Leu 

Leu Tyr Trp Thr 
240 

Tyr Thr Gly Glu 
255 

Met Asp He His 
270 

Pro Cys Gly He 
285 

Pro Val Lys Pro 

Leu Glu Asn Gly 
320 

Leu Ala Arg Arg 
335 

Phe Thr Asp He 
350 

He Asp Tyr Asp 
365 

Val Arg Ala He 

Val Val Thr Ala 
400 

Val Ala Arg Asn 
415 

Val Thr Arg Leu 
430 

Leu Glu Glu Pro 
445 

Tyr Trp Thr Asp 

Asp Gly Ser Asp 
480 

Asn Gly Leu Ala 
495 

Ala Lys Thr Asp 
510 

Arg Val Leu Val 
525 

Leu Gly Asp Tyr 
Arg Val His Lys 



-296- 



WO 02/092000 



PCTYUS02/14877 



545 550 555 560 

Arg Ser Ala Glu Arg Glu Val lie lie Asp Gin Leu Pro Asp Leu Met 

565 570 575 

Gly Leu Lys Ala Thr Asn Val His Arg Val lie Gly Ser Asn Pro Cys 

58 0 585 590 

Ala Glu Glu Asn Gly Gly Cys Ser His Leu Cys Leu Tyr Arg Pro Gin 

595 600 605 

Gly Leu Arg Cys Ala Cys Pro lie Gly Phe Glu Leu lie Ser Asp Met 

610 615 620 

Lys Thr Cys lie Val Pro Glu Ala Phe Leu Leu Phe Ser Arg Arg Ala 
625 630 635 640 

Asp lie Arg Arg lie Ser Leu Glu Thr Asn Asn Asn Asn Val Ala lie 

645 650 655 

Pro Leu Thr Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Thr 

660 665 670 

Asp Asn Arg lie Tyr Trp Thr Asp lie Ser Leu Lys Thr lie Ser Arg 

675 680 685 

Ala Phe Met Asn Gly Ser Ala Leu Glu His Val Val Glu Phe Gly Leu 

690 695 700 

Asp Tyr Pro Glu Gly Met Ala Val Asp Trp Leu Gly Lys Asn Leu Tyr 
705 710 715 720 

Trp Ala Asp Thr Gly Thr Asn Arg lie Glu Val Ser Lys Leu Asp Gly 

725 730 735 

Gin His Arg Gin Val Leu Val Trp Lys Asp Leu Asp Ser Pro Arg Ala 

740 745 750 

Leu Ala Leu Asp Pro Ala Glu Gly Phe Met Tyr Trp Thr Glu Trp Gly 

755 ^ 760 765 

Gly Lys Pro Lys lie Asp Arg Ala Ala Met Asp Gly Ser Glu Arg Thr 

770 775 780 

Thr Leu Val Pro Asn Val Gly Arg Ala Asn Gly Leu Thr lie Asp Tyr 
785 790 795 800 

Ala Lys Arg Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Leu lie Glu 

805 810 815 

Ser Ser Asn Met Leu Gly Leu Asn Arg Glu Val lie Ala Asp Asp Leu 

820 825 830 

Pro His Pro Phe Gly Leu Thr Gin Tyr Gin Asp Tyr lie Tyr Trp Thr 

835 840 845 

Asp Trp Ser Arg Arg Ser lie Glu Arg Ala Asn Lys Thr Ser Gly Gin 

850 855 860 

Asn Arg Thr lie lie Gin Gly His Leu Asp Tyr Val Met Asp lie Leu 
865 870 875 880 

Val Phe His Ser Ser Arg Gin Ser Gly Trp Asn Glu Cys Ala Ser Ser 

885 890 895 

Asn Gly His Cys Ser His Leu Cys Leu Ala Val Pro Val Gly Gly Phe 

900 905 910 

Val Cys Gly Cys Pro Ala His Tyr Ser Leu Asn Ala Asp Asn Arg Thr 

915 920 925 

Cys Ser Ala Pro Thr Thr Phe Leu Leu Phe Ser Gin Lys Ser Ala lie 

930 935 940 

Asn Arg Met Val lie Asp Glu Gin Gin Ser Pro Asp lie lie Leu Pro 
945 950 955 960 

lie His Ser Leu Arg Asn Val Arg Ala lie Asp Tyr Asp Pro Leu Asp 

965 970 975 

Lys Gin Leu Tyr Trp lie Asp Ser Arg Gin Asn Met lie Arg Lys Ala 
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980 985 990 

Gin Glu Asp Gly Ser Gin Gly Phe Thr Val Val Val Ser Ser Val Pro 

995 1000 1005 

Ser Gin Asn Leu Glu lie Gin Pro Tyr Asp Leu Ser lie Asp lie Tyr 

1010 1015 1020 

Ser Arg Tyr lie Tyr Trp Thr Cys Glu Ala Thr Asn Val lie Asn Val 
1025 " 1030 1035 1040 

Thr Arg Leu Asp Gly Arg Ser Val Gly Val Val Leu Lys Gly Glu Gin 

1045 1050 1055 

Asp Arg Pro Arg Ala lie Val Val Asn Pro Glu Lys Gly Tyr Met Tyr 

1060 1065 1070 

Phe Thr Asn Leu Gin Glu Arg Ser Pro Lys lie Glu Arg Ala Ala Leu 

1075 1080 1085 

Asp Gly Thr Glu Arg Glu Val Leu Phe Phe Ser Gly Leu Ser Lys Pro 

1090 1095 1100 

lie Ala Leu Ala Leu Asp Ser Arg Leu Gly Lys Leu Phe Trp Ala Asp 
1105 1110 1115 1120 

Ser Asp Leu Arg Arg lie Glu Ser Ser Asp Leu Ser Gly Ala Asn Arg 

1125 1130 1135 

lie Val Leu Glu Asp Ser Asn lie Leu Gin Pro Val Gly Leu Thr Val 

1140 1145 1150 

Phe Glu Asn Trp Leu Tyr Trp lie Asp Lys Gin Gin Gin Met lie Glu 

1155 1160 1165 

Lys lie Asp Met Thr Gly Arg Glu Gly Arg Thr Lys Val Gin Ala Arg 

1170 1175 1180 

lie Ala Gin Leu Ser Asp lie His Ala Val Lys Glu Leu Asn Leu Gin 
1185 1190 1195 1200 

Glu Tyr Arg Gin His Pro Cys Ala Gin Asp Asn Gly Gly Cys Ser His 

1205 1210 1215 

lie Cys Leu Val Lys Gly Asp Gly Thr Thr Arg Cys Ser Cys Pro Met 

1220 1225 1230 

His Leu Val Leu Leu Gin Asp Glu Leu Ser Cys Gly Glu Pro Pro Thr 

1235 1240 1245 

Cys Ser Pro Gin Gin Phe Thr Cys Phe Thr Gly Glu lie Asp Cys lie 

1250 1255 1260 

Pro Val Ala Trp Arg Cys Asp Gly Phe Thr Glu Cys Glu Asp His Ser 
1265 1270 1275 1280 

Asp Glu Leu Asn Cys Pro Val Cys Ser Glu Ser Gin Phe Gin Cys Ala 

1285 1290 1295 

Ser Gly Gin Cys lie Asp Gly Ala Leu Arg Cys Asn Gly Asp Ala Asn 

1300 1305 1310 

Cys Gin Asp Lys Ser Asp Glu Lys Asn Cys Glu Val Leu Cys Leu lie 

1315 1320 1325 

Asp Gin Phe Arg Cys Ala Asn Gly Gin Cys He Gly Lys His Lys Lys 

1330 1335 1340 

Cys Asp His Asn Val Asp Cys Ser Asp Lys Ser Asp Glu Leu Asp Cys 
1345 1350 1355 1360 

Tyr Pro Thr Glu Glu Pro Ala Pro Gin Ala Thr Asn Thr Val Gly Ser 

1365 1370 1375 

Val He Gly Val He Val Thr He Phe Val Ser Gly Thr Val Tyr Phe 

1380 1385 1390 

He Cys Gin Arg Met Leu Cys Pro Arg Met Lys Gly Asp Gly Glu Thr 

1395 1400 1405 

Met Thr Asn Asp Tyr Val Val His Gly Pro Ala Ser Val Pro Leu Gly 
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1410 1415 1420 

Tyr Val Pro His Pro Ser Ser Leu Ser Gly Ser Leu Pro Gly Met Ser 
1425 1430 1435 1440 

Arg Gly Lys Ser Met lie Ser Ser Leu Ser He Met Gly Gly Ser Ser 

1445 1450 1455 

Gly Pro Pro Tyr Asp Arg Ala His Val Thr Gly Ala Ser Ser Ser Ser 

1460 1465 1470 

Ser Ser Ser Thr Lys Gly Thr Tyr Phe Pro Ala He Leu Asn Pro Pro 

1475 1480 1485 

Pro Ser Pro Ala Thr Glu Arg Ser His Tyr Thr Met Glu Phe Gly Tyr 

1490 1495 1500 

Ser Ser Asn Ser Pro Ser Thr His Arg Ser Tyr Ser Tyr Arg Pro Tyr 
1505 1510 1515 1520 

Ser Tyr Arg His Phe Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val 

1525 1530 1535 

Cys Asp Ser Asp Tyr Ala Pro Ser Arg Arg Met Thr Ser Val Ala Thr 

1540 1545 1550 

Ala Lys Gly Tyr Thr Ser Asp Leu Asn Tyr Asp Ser Glu Pro Val Pro 

1555 1560 1565 

Pro Pro Pro Thr Pro Arg Ser Gin Tyr Leu Ser Ala Glu Glu Asn Tyr 

1570 1575 1580 

Glu Ser Cys Pro Pro Ser Pro Tyr Thr Glu Arg Ser Tyr Ser His His 
1585 ~ 1590 1595 1600 

Leu Tyr Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 

<210> 765 
<211> 207 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> cytoplasmic domain and portion of transmembrane 
domain 



<400> 765 






















His 


Arg 


Val 


Val 


Cys 


Gin 


Arg 


Tyr 


Ala 


Gly Ala 


Asn 


Gly 


Pro 


Phe Pro 


1 








5 








10 








15 




Glu 


Tyr 


Val 


Ser 


Gly 


Thr 


Pro 


His 


Val Pro 


Leu 


Asn 


Phe 


He Ala 


Pro 






20 










25 








30 




Gly 


Gly 


Ser 


Gin 


His 


Gly 


Pro 


Phe 


Thr Gly 


He 


Ala 


Cys 


Gly Lys 


Ser 




35 










40 








45 






Met 


Met 
50 


Ser 


Ser 


Val 


Ser 


Leu 
55 


Met 


Gly Gly 


Arg 


Gly 
60 


Gly 


Val Pro 


Leu 


Tyr 


Asp 


Arg 


Asn 


His 


Val 


Thr 


Gly 


Ala Ser 


Ser 


Ser 


Ser 


Ser Ser 


Ser 


65 










70 








75 








80 


Thr 


Lys 


Ala 


Thr 


Leu 


Tyr 


Pro 


Pro 


He Leu 


Asn 


Pro 


Pro 


Pro Ser 


Pro 








85 






90 








95 




Ala 


Thr 


Asp 


Pro 
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Ser Ala Ser Arg Trp Lys Ala Ser 
145 150 
Asp Ser Asp Pro Tyr Pro Pro Pro 
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